- Introduction
- Research Questions
- Project Overview
- Data Resources
- Conclusions and Insights
- Contributors/Team
- License
In this project, we analyzed discussions from the Entry-Level Luxury Car Forum. Utilizing data scraping and Lift Analysis techniques, we discerned prevalent brand comparisons and the attributes being discussed. Furthermore, we evaluated consumer aspirations by analyzing brand mentions associated with aspiration-indicative words.
- Develop a Python scraper to retrieve messages from Edmunds.com discussion forums. The output should be a .csv file containing columns: date and message. Study the forum's HTML structure and threading patterns before creating the scraper.
- Extract approximately 5000 car-related posts from the Entry Level Luxury forum.
- Test the data's adherence to Zipf’s law econometrically and plot the 100 most common words against the law's theoretical prediction. Do not remove stopwords or perform stemming/lemmatization.
- Identify the top 10 brands from frequency counts. Create a script to count word frequencies (excluding stopwords) and replace common car models with brand names.
- Calculate lift ratios for associations between the top-10 brands from Task A. Develop a script for this task, ensuring a message is not counted more than once per post.
- Illustrate the brands on a multi-dimensional scaling (MDS) map using a Python script.
- Offer insights to your client based on analyses in Tasks C and D.
- Identify the 5 most frequently mentioned car attributes or features in the discussions and their strong associations with the top 5 brands.
- Provide strategic advice to your client based on findings from Task F.
- Determine the most aspirational brand in your data and discuss the business implications for this brand.
During the initial phase of the project, we focused on gathering substantial data from the Entry-Level Luxury Car Forum. This data served as a rich resource for understanding consumer preferences and discussions revolving around various car brands and their attributes.
In Task B, we conducted a frequency analysis to identify the top mentioned brands and common terms in the discussions. The analysis revealed the following brands as the most frequently mentioned:
- BMW: 2003 mentions
- Audi: 1467 mentions
- Acura: 1312 mentions
- Honda: 949 mentions
- Seat: 594 mentions
- Volkswagen: 469 mentions
- Sedan: 424 mentions
- Infiniti: 395 mentions
- Cadillac: 379 mentions
- Hyundai: 358 mentions
- Toyota: 345 mentions
- Problem: 309 mentions
- Ford: 275 mentions
Following Task B, in Task C, we utilized the data to calculate lift ratios between the top-mentioned brands. This involved creating a heatmap visualization that clearly displayed the lift scores between these brands, providing a graphical representation of the associations and relationships between them. Here is the heatmap visualization:
In Task D, we further analyzed the data through a multi-dimensional scaling (MDS) plot, which showcased two distinct clusters: luxury and non-luxury car brands. This visualization helped in delineating the perceptual differences between the brands, offering a clear picture of the market dynamics and consumer preferences. Here is the MDS plot:
Analysis Type | Insight Category | Description |
---|---|---|
MDS Coordinates | Cluster of Luxury Brands | Brands like 'bmw', 'audi', and 'infiniti' are grouped closely together, suggesting they are perceived similarly, likely as luxury or high-end brands. |
Mainstream vs. Luxury | 'honda' and 'toyota' are distanced from luxury brands like 'bmw', indicating a differentiation in the market perception between mainstream and luxury brands. | |
Isolated Brands | 'ford' and 'honda' appear as outliers, suggesting they are unique in certain aspects compared to other brands. | |
Lift Values | Affinity Between Luxury Brands | High lift values for combinations like ('audi', 'infiniti') and ('bmw', 'audi') show that these luxury brands often appear together, suggesting a strong affinity among consumers for these brands. |
Mainstream Affinities | High lift values between ('honda', 'toyota') and ('hyundai', 'toyota') suggest that these mainstream brands share a lot of similarities in consumer perception. | |
Mixed Affinities | Brands like 'audi' and 'volkswagen' have a significant lift value, indicating that consumers often consider these brands together, despite one being a luxury brand and the other more mainstream. | |
Integrated Insights | Cross-Segment Appeal | 'audi' seems to have a broader appeal, as indicated by high lift values with both luxury ('bmw', 'infiniti') and non-luxury ('volkswagen') brands. This could be a unique selling proposition. |
Potential Partnerships | High lift values between brands like ('audi', 'infiniti') and ('honda', 'toyota') could imply beneficial partnerships or co-branding opportunities. | |
Targeting Strategy | If the client is, for example, 'audi', they could focus marketing strategies on consumers who also show interest in 'bmw' or 'infiniti' given their close proximity in the MDS plot and high lift values. | |
Competitive Positioning | Brands like 'ford' that are isolated in the MDS plot but have lower lift values with other brands may need to re-evaluate their market positioning. |
Analysis Type | Insight Category | Relevancy with Current Real-World Market Perception |
---|---|---|
MDS Coordinates | Luxury Cluster | Brands like BMW, Audi, and Infiniti are generally considered luxury brands. Their close proximity in the MDS plot aligns well with market perceptions. |
Mainstream Brands | Honda and Toyota are perceived as mainstream and reliable brands. Their distinct positioning in the MDS plot is consistent with this perception. | |
Isolated Brands | Ford has a diverse product line ranging from trucks to sedans to electric vehicles. Its unique positioning in the MDS plot could reflect this diversity. | |
Lift Values | High Affinity Among Luxury Brands | High lift values among Audi, BMW, and Infiniti suggest that consumers interested in one luxury brand are likely to consider others. This aligns with the general perception that these brands compete in the same luxury market segment. |
Mainstream Affinities | High lift values for Honda-Toyota and Hyundai-Toyota pairs are consistent with the market segment that values reliability and cost-effectiveness. These brands are often compared and considered together by consumers. | |
Cross-Segment Appeal | Audi has high lift values with both luxury (BMW, Infiniti) and mainstream (Volkswagen) brands. This may reflect Audi's broader market appeal and is consistent with its positioning as a luxury brand that offers some more accessible models. | |
Integrated Insights | Partnerships and Co-branding | The high lift values among certain brand pairs suggest potential for partnerships or co-branding opportunities. This is not uncommon in the automotive industry where technology and platforms are often shared. |
Competitive Positioning | Ford's unique positioning could be both an opportunity and a challenge. The brand has been investing in electric and autonomous vehicles, which could explain its distinct market perception and offer avenues for differentiation. |
During our analysis in Task F, we identified the top attributes that were most frequently mentioned in the discussions. These attributes give us an insight into what features or aspects are most discussed or considered by consumers when discussing entry-level luxury cars. The top attributes mentioned are as follows:
-
Engine - This attribute being the most mentioned indicates that the engine specifications and performance are a significant topic of discussion and consideration among consumers.
-
Sport - This suggests that the sportiness or the sport features of the cars are a popular topic of discussion, possibly indicating a preference or a trend in the market towards sportier vehicles.
-
Driving - This attribute being frequently mentioned indicates that the driving experience, which includes aspects like handling, comfort, and driving pleasure, are significant considerations for consumers.
-
Miles - This attribute being mentioned frequently suggests that mileage or fuel efficiency is a significant consideration for consumers, possibly indicating a preference for cars that offer better mileage.
-
Dealer - This being a top attribute indicates that discussions often involve dealer experiences, which could include topics like customer service, dealer policies, and purchasing experiences.
These insights can be instrumental in understanding consumer preferences and trends in the entry-level luxury car market.
To visualize the associations, we plotted a heatmap that clearly illustrates the lift scores between the brands and these attributes. This heatmap serves as a graphical representation of how strongly these attributes are associated with each brand, providing insights into the perception and discussion trends surrounding each brand in the forum.
Here is the heatmap visualization illustrating the lift scores between the brands and the top attributes:
In Task G, we synthesize the insights derived from the lift analysis conducted in Task F to formulate strategic advice for the top brands in the market. The advice is based on the lift scores between the brands and the top attributes discussed in the forum. Here, we present a table that outlines the strategic advice for each brand based on their association with the top attributes:
Brand | Attribute | Lift Score | Strategic Advice |
---|---|---|---|
BMW | Sport | 1.407 | Leverage the brand's strong association with "sportiness" in marketing campaigns. |
Audi | Dealer | 1.395 | Emphasize the quality of Audi dealerships and customer service to attract more buyers. |
Toyota | Sport | 1.244 | Explore options to market sportier models or features, given the unexpected strong lift score with "sport". |
Audi | Sport | 1.165 | Consider co-marketing sporty models alongside your dealer network. |
BMW | Dealer | 1.161 | Enhance the customer experience at dealerships to solidify the positive association. |
Acura | Dealer | 1.074 | Improve dealership experiences as this attribute appears to be a strong point for the brand. |
Infiniti | Sport | 1.042 | Utilize the sporty image in marketing campaigns to attract a younger or more dynamic audience. |
Honda | Sport | 0.999 | Investigate why the brand nearly has a neutral association with "sport" and consider if repositioning is needed. |
Ford | Dealer | 0.643 | Focus on improving the customer experience at dealerships to boost this low lift score. |
Cadillac | Engine | 0.135 | Urgently address the negative perception around the engine quality or performance in your vehicles. |
This table serves as a strategic guide for brands to align their marketing and operational strategies with the perceptions and discussions prevalent among consumers in the entry-level luxury car market. It provides a roadmap for brands to leverage their strengths and address the areas where they have room for improvement, based on consumer discussions and perceptions.
In Task H, our objective was to identify the most aspirational brand in the dataset. Initially, we calculated the lift scores between various brands and the attribute "aspiration". However, the lift scores did not provide a clear indication of the relationship between the brands and the level of aspiration associated with them. Here are the lift scores we obtained:
Brand | Attribute | Lift Score |
---|---|---|
BMW | Aspiration | 0.753474 |
Acura | Aspiration | 0.703555 |
Audi | Aspiration | 0.687380 |
Infiniti | Aspiration | 0.656413 |
Honda | Aspiration | 0.608119 |
Cadillac | Aspiration | 0.579710 |
Ford | Aspiration | 0.536327 |
Toyota | Aspiration | 0.525829 |
Volkswagen | Aspiration | 0.459367 |
Hyundai | Aspiration | 0.446224 |
Given that the lift values did not provide a clear insight, we decided to consider the number of posts mentioning "aspiration" alongside the respective brands as a metric to gauge the level of aspiration associated with each brand. Here are the results:
Brand | Attribute | # of Posts |
---|---|---|
BMW | Aspiration | 126 |
Audi | Aspiration | 78 |
Acura | Aspiration | 66 |
Honda | Aspiration | 45 |
Infiniti | Aspiration | 24 |
Cadillac | Aspiration | 19 |
Volkswagen | Aspiration | 18 |
Toyota | Aspiration | 18 |
Ford | Aspiration | 15 |
Hyundai | Aspiration | 13 |
From this analysis, it is evident that BMW is perceived as the most aspirational brand, followed by Audi and Acura. This information can be instrumental in shaping brand strategies, especially in terms of positioning and targeting in the market.
The following data files are essential for the analysis:
- DiscussionData.csv: This file has the data we scraped from the forum for our analysis
- car_models_and_brands.csv: This file helps in identifying brands even when only car models are mentioned in the discussions
The data analysis reveals a distinct separation between luxury and mainstream brands. Luxury brands predominantly excel in aspects like aspiration and sportiness, whereas mainstream brands have a broader attribute spectrum. This presents an opportunity for brands to accentuate their strengths and address weaknesses. For instance, Cadillac could enhance their engine quality perception, while Audi could strategize to reach a wider consumer base. Brands might also consider forming partnerships based on high lift values to explore complementary market segments.
- Manideep Telukuntla
- Karthick Vel Kathirvel
- Spandan Pal
- Sharan Arora
- Pritesh Singh
- Raghav Vaidya
Licensed under MIT License