How 6 LLMs answer data science interview question

A few weeks ago I wrote about my favorite data science interview question. Since so many folks are all excited about AI these days, I thought it would be fun to see how well some popular Large Language Models (LLMs) would answer my question.
Today's LLMs are not data scientists
LLMs have the potential to transform fields like engineering and data science, but are unlikely to replace data scientists entirely. Instead, they are reshaping their roles and responsibilities by automating routine tasks and enabling a focus on higher-level strategic activities.
As a David Letterman fan, here are my Top 10 Limitations of today's LLMs compared to data scientists:
- Lack of True Understanding
- Hallucinations and Inaccuracies
- Dependence on Quality Data
- Inability to Handle Non-Textual Data
- Ethical and Privacy Concerns
- Lack of Long-Term Memory
- Complexity and Interpretability
- Selling Ideas to Stakeholders
- Running and Owning Experiments
- Don't Offer Accountability
The prompt:
I am interviewing candidates for a senior data science position at Rent the Runway (clothing rental). I am going to have a 30 minute discussion around the following question “what clothes should Rent the Runway buy for the upcoming season and at what price?”. I want you to consider that we need to support both single event (4 day) rental use cases and subscription use cases.
In order to get a more suitable response, I often had to follow on with prompts with more context such as:
This is pretty high level, can you please be more detailed. This would not be an answer I would expect from a senior data scientist.
Criteria for evaluating responses:
As part of my blog post my favorite data science interview question, I outlined the following areas for evaluating responses:
- Discussions about our Data
- Discussions around Demand
- Discussions around Unit Economics
- Discussions around Operational Concerns
- General Interview Aspects
The models:
Click on each logo below for model details and the model's response:
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
ChatGPT:
ChatGPT is a large language model developed by OpenAI. It is based on the GPT-3 architecture and has been fine-tuned on a variety of conversational data.
ChatGPT's response:
Step 1: Understanding Customer Segments and Use CasesBefore diving into data, the first thing a senior data scientist would do is clearly differentiate between the two primary customer segments: event rentals (4-day rentals) and subscriptions. The candidate should acknowledge that each has distinct demands.
Step 2: Data-Driven Insights for Item SelectionHistorical Data Analysis & Forecasting A senior data scientist would start by accessing past rental data to identify patterns and trends. This includes analyzing:
Step 3: Predicting Trends and IdentifyingPopular Categories for Upcoming Season
Step 4: Pricing Strategy & Optimization
Step 5: Operational and Inventory ConsiderationsInventory Management Optimization: Forecasting Demand per Size/Style/Category: To avoid overstocking or stockouts, use demand forecasting models (e.g., Poisson regression) to predict demand per size/style and ensure that the inventory is stocked accordingly.
Replenishment Strategy: Based on the forecasted demand, set automated replenishment triggers for high-demand items (e.g., reorder a popular dress or size once it reaches 80% utilization). Step 6: Presenting Results and Making RecommendationsFinally, a senior data scientist would present the findings with actionable insights:
ConclusionThis approach integrates advanced statistical modeling, machine learning techniques, forecasting, and operational optimization strategies, reflecting the depth and rigor expected from a senior data scientist. |
My thoughts about ChatGPT's response:
- I expect more detail in the response for the data that candidates would be interested in using for exploring / modeling activities.
- I expect some focus around thinking about overall demand for a season's buy. It was interesting to see how the response focused more on thoughts around specifics user cohorts and types of products.
- It didn't address the "what price" aspect of the question very well. I expect some kind of basic explanation around unit economics.
- I was very surprised to see the response include python psuedo-code snippets. I did not find it especially helpful and I would not expect a senior data scientist to be writing code for a 30 minute presentation.
- I felt like much of this response was based on some incorrect assumptions about how the business operates. It leaves me curious about the data set OpenAI used to train this model.
Claude Sonet 3.5:
Claude is a chatbot and family of large language models (LLMs) that uses artificial intelligence (AI) to generate text and have conversations with users. It was developed by Anthropic, a research firm that aims to make AI more helpful, harmless, and honest.
In general, I find Claude to be the chatbot I find myself most regularly using these days. I feel like Sonet 3.5 has the best support coding assistance, especially when running through an AI coding tool like Cursor.
Claude's response:
Thank you for the question. Let me approach this systematically: First, I'd like to clarify that we need to make two key decisions: inventory selection and pricing strategy. For this rental model, we need to optimize both the initial purchase price and the rental price to maximize ROI while maintaining customer satisfaction. Let me outline my analytical approach: 1. Customer Segmentation AnalysisI would first segment our customers based on:
Using k-means clustering on these variables, let's say we identify four primary segments:
2. Inventory AnalysisFor each segment, I would analyze:
Key metrics I'd calculate:
3. Price Optimization ModelI would build a mixed integer programming model with:
Objective Function: Maximize: Σ(Expected rental revenue - Purchase costs - Operating costs) 4. Specific RecommendationsBased on this analysis, here's what I would recommend: Purchase Strategy:
Pricing Strategy:
Implementation:
|
My thoughts about Claude's response:
Claude jumped into describing some more advanced concepts suggested here compared to some of the other LLMs:
- I like the idea of segmenting customers with some potential features. Clustering to identify user groups is an approach our data team took.
- Key metrics identified are all things we used.
- Pricing optimization was too high level.
- It is interesting the model came up with purchase strategy. I have no idea what assumptions this is based off of
- Implementation approach wouldn't work because it doesn't consider that:
- Purchase cycles are 6 months before the season
- A/B test pricing strategies sounds good, but you will hear backlash on social media when folks see different prices for the same item. And you have to be careful with potential price discrimination.
- For a subscription model, you can't adjust pricing monthly. You have to grandfather in existing customers.
Deepseek R1:
DeepSeek-R1 is a direct competitor to OpenAI's o1. One difference between V3 and R1 is that when chatting with R1, we don't immediately get a response. The model first uses chain-of-thought reasoning to think about the problem. Only once it finishes thinking it starts outputting the answer.
Deepseek's response:
Okay, so I need to figure out what clothes Rent the Runway should buy for the upcoming season and at what price. The interview question is for a senior data science position, so they probably expect a structured, data-driven approach. Let me start breaking this down. First, Rent the Runway has two main use cases: single-event (4-day) rentals and subscriptions. I need to consider both. The goal is to determine the optimal inventory and pricing strategy. I should start by thinking about the data sources available. Maybe historical rental data, customer preferences, past sales trends, inventory turnover rates, customer feedback, maybe even external data like fashion trends or economic factors. But since I don't have access to actual data, I'll have to make assumptions based on typical data science approaches. For the subscription model, customers probably want a variety of options to keep their wardrobe fresh. They might prioritize versatility and current trends. Single-event renters might be looking for specific items for occasions like weddings, parties, etc. So the inventory needs might differ. Subscription items might be more about everyday wear or frequent use, while single-event could be more about statement pieces or formal wear. Pricing strategies for each model might vary. Subscriptions are recurring revenue, so maybe lower per-item pricing but higher lifetime value. Single-event could be priced higher per rental but with less frequency. Need to balance inventory allocation between the two to maximize revenue without stockouts or overstock. Next, think about factors influencing what to buy. Historical data on which items were popular in past seasons, maybe by region, season, event type. Also, customer demographics: age, location, preferences. For example, winter might require more coats, holiday season more party dresses. Also, considering sustainability—maybe items that can be rented multiple times without wear and tear. Pricing could depend on demand elasticity. High-demand items can be priced higher. Maybe dynamic pricing based on availability and demand. Also, cost of acquisition, maintenance, cleaning, shipping. Need to ensure that the rental price covers these costs and provides a profit margin. Machine learning models could help predict demand for different items. Time series analysis for seasonal trends, classification models to categorize items into high/low demand, clustering to identify customer segments. Maybe A/B testing different pricing strategies. For subscriptions, understanding the churn rate and how inventory affects retention. If subscribers don't get enough variety or desired items, they might cancel. So inventory needs to align with subscriber preferences and usage patterns. Inventory turnover is crucial. Items that rent quickly and frequently are better. Maybe prioritize items with high turnover rates for subscriptions to keep the inventory fresh. For single-event, items with lower turnover but higher per-rental profit might be acceptable. Competitor analysis could also play a role. What are other rental services offering? How does Rent the Runway differentiate? Maybe through exclusive partnerships with designers or offering premium items. Another angle: customer feedback and reviews. Analyzing reviews to see what customers liked or didn't like about past rentals. Sentiment analysis on reviews could highlight desirable features (e.g., comfort, style, fit). Collaborative filtering or recommendation systems could suggest items similar to popular ones. Maybe identifying gaps in the inventory where demand isn't being met. For pricing, maybe a combination of cost-plus and value-based pricing. Also, considering the rental frequency—if an item is rented multiple times, the cost per rental decreases, allowing for competitive pricing. Need to balance the inventory between trendy, fast-fashion items and timeless classics. Trendy items might have high short-term demand but become obsolete quickly. Classics have longer shelf life but might not command high prices. Data on returns and damages: items that are frequently damaged might not be cost-effective. Need to factor in maintenance costs when deciding to purchase. Also, considering size inclusivity. Ensuring a range of sizes could attract a broader customer base and increase rental frequency. Possible steps:
Potential challenges: Balancing stock levels to avoid stockouts (lost revenue) and overstock (increased costs). Ensuring the pricing strategy doesn't alienate customers but remains profitable. Adapting to sudden changes in fashion trends. Metrics to track: Rental frequency, customer acquisition/retention rates, profit per item, inventory turnover, customer satisfaction scores. I need to present this in a structured way during the interview. Start by outlining the problem, then discuss data sources, analytical approaches, models, and considerations for both inventory and pricing. Emphasize the balance between subscription and single-use cases, and how data-driven decisions can optimize both. |
To determine what clothes Rent the Runway should buy for the upcoming season and at what price, we need a data-driven approach that balances the needs of both single-event (4-day) renters and subscription customers. Here's a structured strategy:
Example Recommendation
This approach balances revenue streams, minimizes stockouts, and aligns with customer expectations for both use cases. |
My thoughts about Deepseek's response:
The Deepseek response thought for a little bit and generated a response in 39 seconds. I really like the content it generated in italics that read more like a conversation. I find this type of content and tone more valuable than the bulleted style lists that most LLMs respond with them.
Gemini:
Google Gemini is an AI-powered assistant that helps users with writing, learning, planning, and more. It's available on Google Cloud, Google Workspace, and as a mobile app.
Gemini's response:
Here's a breakdown of how a strong candidate might approach the "what clothes should Rent the Runway buy for the upcoming season and at what price?" question, considering both single event (4-day) rentals and subscription use cases. This framework emphasizes a structured, data-driven approach. I. Understanding the Business Objectives (5 minutes)A good candidate will start by clarifying the goals. They shouldn't just jump into modeling. They should ask questions like:
II. Data Analysis and Feature Engineering (10 minutes)This is where the candidate demonstrates their data science skills. They should discuss the data they'd need and how they'd analyze it:
III. Model Building and Evaluation (7 minutes)The candidate should discuss the modeling approach:
IV. Recommendation and Implementation (8 minutes)The candidate should tie their analysis back to concrete recommendations:
Key Takeaways for the Interviewer:
|
My thoughts about Gemini's response:
Things I like:
- It provides a timebox around each section, to make sure the candidate is moving through the problem at a good pace.
- It starts with business metrics, posed as questions. This is a good approach to make sure the candidate is thinking about the business problem.
Things I like less:
- There is no way we are going to get through this much content in 30-45 minutes. Assuming this will be an actual conversation and I would ask questions as we go along, maybe we would get through half of this.
- While I can see the desire to propose potential models for specific use cases, I think the candidate likely would not have enough context about the problem space and the business complexities.
Perplexity:
Perplexity is a free AI search engine that provides answers to user questions in real time. It's designed to be a research partner that helps people learn information in their own way.
I find myself using perplexity in many cases where I had previously used Google or Stack Overflow to search for answers. I like that it provides references to its answers so I can dig deeper and validate.
Perplexity's response:
Got it! Here's how a senior data scientist candidate might respond to the question, “What clothes should Rent the Runway buy for the upcoming season and at what price?” in a detailed, technical, and data-driven way: Answer from a CandidateTo determine what clothes Rent the Runway (RTR) should buy for the upcoming season and at what price, I would approach this problem in three stages: data exploration and modeling, inventory optimization, and pricing strategy. Here's my step-by-step approach: 1. Data Exploration and Demand ForecastingThe first step is to forecast demand for different types of clothing based on historical data. I would:
For example:
2. Inventory OptimizationOnce we have demand forecasts, the next step is to optimize inventory selection to balance customer satisfaction with operational efficiency.
3. Pricing StrategyPricing must balance profitability with customer satisfaction across both use cases.
4. Continuous ImprovementFinally, I'd recommend setting up a feedback loop:
Example Output If I were making specific recommendations based on this framework:
This approach ensures we maximize revenue while meeting diverse customer needs across both use cases. This answer demonstrates technical depth while tying it back to actionable insights tailored to Rent the Runway's business model. |
My thoughts about Perplexity's response:
I find it useful that Perplexity offers some useful follow up questions, some similar to what I found candidates asking in the past.
I think this answer is pretty good, a lot of useful concepts. I would have liked to seen some discussion around unit econonmics more detailed than the simple "Calculate cost-per-rental" formula.
OpenAI Deep Research:
OpenAI Deep Research is an agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you. Subscription costs $200 per month to ask 100 questions, so this is pretty pricey.
One of my friends bought a month's subscription of this new hotness (Deep Research launched a few weeks ago) and was nice enough to run this question for me. I heard about all the hype and was interested to see what it comes up with.
Deep Research's response:
Data-Driven Inventory and Pricing Strategy for Rent the RunwayRent the Runway (RTR) should integrate data analytics with strategic business insights to decide which clothes to buy and at what price. This involves a holistic approach that balances demand forecasting, pricing optimization, sustainability, designer partnerships, operations, customer segmentation, and competitive trends. Below is a structured strategy addressing each key factor: Customer Demand ForecastingUse data-driven forecasting to predict what styles, sizes, and designers will be most popular. RTR has a wealth of historical rental data to inform these decisions. By applying machine learning to past rentals and browsing behavior, RTR can identify trends and seasonality in demand and even predict customer preferences for new styles (D3.HARVARD.EDU). Key actions include:
By forecasting demand with precision, RTR can buy the right styles in the right quantities, minimizing overstock of low-demand items and maximizing availability of favorites. Pricing OptimizationDetermine the ideal rental pricing and purchase cost for each garment using data science models. Each item in RTR's inventory is an asset that generates revenue through rentals over time and then may be sold on the resale market. Optimizing pricing means maximizing the lifetime value of that asset relative to its cost. Considerations for pricing include:
Using these factors, RTR can set a target buy price and rental rate for each item. For instance, if a dress is expected to be popular (rent often) and hold value, RTR might pay a premium to acquire it and set a moderate rental price to encourage volume. Conversely, for a trendy piece that might go out of style quickly, RTR should negotiate a lower buy cost or shorter rental window to mitigate risk. Data scientists at RTR focus on profit modeling and pricing strategy for inventory to hit the sweet spot in price and demand (RESOURCES.CDO.MIT.EDU). The end result is a pricing strategy that maximizes revenue per item while delivering value to customers. Sustainability ConsiderationsInventory decisions should align with RTR's brand as a leader in sustainable, “closet in the cloud” fashion. This means prioritizing durable, high-quality garments and considering environmental impact in purchasing:
By baking sustainability into inventory decisions, RTR not only controls long-term costs (through durability and fewer replacements) but also strengthens its brand reputation. Customers who care about sustainability will appreciate seeing high-quality, eco-conscious garments available, creating a virtuous cycle of loyalty and positive brand image. Designer PartnershipsLeverage strong relationships with designers and brands to acquire the right inventory at favorable economics. RTR should act as a strategic partner to designers, using its platform and data to create win-win deals:
By working closely with 750+ designer partners (RENTTHERUNWAY.COM), Rent the Runway can ensure a pipeline of fresh, high-ROI inventory. The strategy is to procure must-have fashions in a cost-effective way, whether by co-designing unique items or structuring deals that reduce upfront costs. Strong partnerships backed by data (which RTR can share with designers to inform their future collections) create a competitive advantage that's hard to copy. Logistics & Operational ConstraintsAny inventory decision must be feasible to manage operationally. RTR's business isn't just buying clothes - it's about storing, cleaning, and shipping them efficiently in a continuous cycle. Key operational factors to consider when deciding what to buy:
In short, practical execution matters. Even the most fashionable dress is a bad investment if it's too costly or slow to handle in the rental cycle. RTR should thus choose inventory that fits within its logistical capabilities, and continually invest in operations (techniques like just-in-time processing, automated sorting, etc.) to expand those capabilities (ZORP.ONE) (ZORP.ONE). Aligning inventory strategy with operational capacity ensures high utilization of each item and a smooth experience for customers. Customer Segmentation & PersonalizationRent the Runway serves a diverse customer base with different tastes and occasions – from young professionals needing work attire to bridesmaids seeking dresses for a wedding. Making smart buying decisions means curating inventory for key customer segments and personalizing the experience:
In essence, know your customer segments intimately and buy accordingly. RTR's competitive edge is bolstered by the richness of its customer data - it knows what a 30-year-old fashionista in NYC likes versus a 40-year-old professional in Chicago. By using these insights, RTR stocks the closet in a way that feels personalized to each member, increasing usage and satisfaction across the board. Competitive BenchmarkingFinally, RTR should continuously scan the competitive landscape and broader market trends to inform its inventory strategy. The fashion rental space has grown and evolved, with multiple players and shifting consumer expectations. Key points here include:
By staying vigilant about industry movements, RTR can adapt its inventory purchases proactively rather than reactively. This external lens, combined with internal data, ensures RTR's strategy isn't formed in a vacuum. It will help answer questions like: Are there must-have designers or collections that everyone is carrying this season? Is there a trend in rental (or resale) that RTR needs to invest in (such as more casual everyday wear as opposed to occasion wear)? Continual benchmarking guarantees that RTR offers a compelling, up-to-date selection and remains one step ahead of customer expectations. ConclusionIn conclusion, Rent the Runway should make inventory purchase and pricing decisions through a blend of data-driven analysis and strategic foresight. By forecasting customer demand with precision, optimizing pricing for lifetime value, upholding sustainability and quality, partnering smartly with designers, aligning with operational capacities, catering to key customer segments, and keeping an eye on the market, RTR can curate a rental collection that delights customers and drives profitable growth. This holistic approach ensures that every item RTR buys earns its keep - attracting renters, justifying its cost through multiple wears, and reinforcing Rent the Runway's position as the premier “closet in the cloud.” Each decision, backed by data and strategic thinking, will help RTR continue innovating in the fashion rental industry while moving toward sustainable profitability. |
My thoughts about Deep Research's response:
The response took about 3 minutes to compile, which was faster than I expected. I think this response was the most in depth of the response and covered the widest variety of subject areas. I liked that it was able to pull in a lot of sources and cite them (similar to perplexity).
I don't think I would pay $200 for the answer provided!
Conclusions
Overall, I think many of the responses were pretty good. I think LLMs provide a solid starting point for researching a new subject area. They brought up some not obvious considerations that many folks might not think of.
I would NOT consider today's LLM offerings capable of replacing a good data scientist!!! No matter how much some LinkedIn influencers and tech blogs try to convince you that LLMs can be a cost efficient alternative.