A Feature-Driven Machine Learning Approach For Netflix Content Prediction and Caching

Research output: Contribution to conferencePaperResearchpeer-review

Abstract

Understanding content popularity is critical for streaming platforms like Netflix to optimize user engagement and predict future trends. This paper explores feature analysis for analyzing Netflix content popularity, focusing primarily on hours viewed as a core metric. Using machine learning models, such as XGBoost, RandomForest, and hybrid approaches, this study examines the interplay of content attributes, viewer preferences, and temporal trends. In addition, a topic modeling approach based on series plots is implemented to identify latent themes and patterns that influence content popularity over time. The proposed framework integrates data preprocessing, model training, hybrid methodologies, and thematic analysis to reveal deeper insights into engagement dynamics. The outcomes demonstrate the effectiveness of selecting the right features with simpler models like Logistic Regression to obtain an accuracy of 82% on the Netflix engagement dataset and provide actionable strategies for enhancing recommendation systems, tailoring content delivery, and aligning offerings to meet evolving viewer preferences.
Original languageEnglish
Publication date2025
Number of pages7
Publication statusPublished - 2025
EventThe 16th International IEEE Conference On Computing, Communication And
Networking Technologies
- Indore, India
Duration: 6 Jul 202511 Jul 2025
Conference number: 16
https://16icccnt.com/

Conference

ConferenceThe 16th International IEEE Conference On Computing, Communication And
Networking Technologies
Number16
Country/TerritoryIndia
CityIndore
Period06/07/202511/07/2025
Internet address

Cite this