Airbnb-Specific Data Patterns Questions
Domain-specific data modeling and analytics patterns used in Airbnb-scale product analytics. Covers data schema design, event and transaction patterns, feature engineering templates for predictive models, cohort and lifecycle analytics, geospatial and temporal data patterns, price and demand forecasting signals, AB testing data patterns, and data quality, governance, and lineage considerations relevant to Airbnb data.
HardTechnical
0 practiced
Design a statistical approach and data pipeline to compute fair exposure metrics and treatment effects when experiments overlap (users may be in multiple concurrent experiments) and there may be assignment interference. Include data schema for experiment logs, statistical adjustments (blocking, stratification, hierarchical models), and how you'd compute unbiased aggregates in the ETL layer.
MediumTechnical
0 practiced
Design an OLAP schema for a bookings fact table to support revenue attribution, promotions, taxes, and multi-currency reporting. Include suggested columns (raw amounts, breakdowns, converted_amount, currency_code, promo_id, fee_ids), data types, partitioning keys, and discuss when to normalize fees into separate dimension tables vs denormalizing into the fact.
MediumTechnical
0 practiced
How would you implement deterministic sampling to produce a reproducible 0.1% sample of events across multiple pipelines for debugging? Describe hashing strategy, choice of key(s) to hash (user_id, event_id), how to avoid bias, and how to change sample rates while preserving history.
HardTechnical
0 practiced
Define metrics and statistical tests to detect data drift in features used by a price prediction model (e.g., distribution shifts, missingness, encoding changes). Design an automated pipeline that monitors these signals, triggers alerts, runs model shadowing and retraining, and safely rolls back models if degradations are detected.
MediumTechnical
0 practiced
For a bookings fact table ingesting ~5TB/day of Parquet data, recommend partitioning and clustering strategies to speed queries commonly filtered by city, date, and host_id. Discuss partition key selection, file size targets, clustering/ordering to improve pruning, and maintenance tasks such as compaction and repartitioning.
Unlock Full Question Bank
Get access to hundreds of Airbnb-Specific Data Patterns interview questions and detailed answers.
Sign in to ContinueJoin thousands of developers preparing for their dream job.