top of page

Create Your First Project

Start adding your projects to your portfolio. Click on "Manage Projects" to get started

Customer Review Sentiment Analysis and Product Recommendation Insights

Date

April 2023

Coding Assignment

This project uses Python and various data science techniques to analyze customer reviews for understanding sentiment, reviewer patterns, and product preferences. Key tools include NetworkX for visualizing product connections, NLTK for sentiment analysis, and KMeans for clustering sentiment patterns.

Google colab

Problem Statement

Objective: “To analyze customer reviews for sentiment, product rating trends, and reviewer behavior to generate insights into customer satisfaction and recommend popular products. The analysis used graph visualization and sentiment analysis to identify review patterns.”
Data Collection and Methodology

Data Source: Customer reviews were sourced from a JSON dataset, containing details such as product IDs, ratings, review text, and reviewer IDs.
Data Processing:
Parsed JSON data to extract review information.
Processed text data for sentiment analysis using NLTK’s Vader sentiment analyzer.
Used KMeans clustering to identify patterns in sentiment and review count.
Key Analyses and Visualizations

Product Review Network:

Visualization: A network graph displaying connections between products frequently reviewed by the same customers.
Insight: Reveals relationships among popular products, offering insights into co-purchase behaviors.
Sentiment Analysis:

Method: Used Vader to determine positive, negative, or neutral sentiment in each review.
Result: Quantified sentiment scores to assess overall customer satisfaction, identifying products with the highest and lowest sentiment.
Clustering by Sentiment and Review Count:

Method: KMeans clustering grouped products based on average sentiment and review volume.
Insight: This analysis highlights popular products and those with positive or negative feedback trends, helping to identify product strengths and areas for improvement.
Technical Challenges and Solutions

Handling Large Text Data: Preprocessed review text for efficient sentiment analysis.
Visualizing Relationships: Employed NetworkX for constructing and simplifying complex product-reviewer graphs.
Clustering Tuning: Adjusted KMeans parameters for meaningful clustering of sentiment and volume.
Tools and Technologies Used

Languages & Libraries: Python, NetworkX (for graph analysis), NLTK (for sentiment analysis), Matplotlib (for plotting), KMeans (for clustering).
Platform: Google Colab for data processing and visualization.
Learning Outcomes

Gained experience in sentiment analysis and clustering techniques.
Developed proficiency in graph visualization to illustrate relationships in customer behavior data.
Enhanced my understanding of data preprocessing for large text-based datasets.

+972-54-994-5850

+1-202-643-9308

  • LinkedIn
  • Instagram
bottom of page