Reference Architectures for Big Data SystemsData Platform at UberData Platform at BMWData Platform at NetflixData Platform at FlipkartData Platform at CoupangData Platform at DoorDashData Platform at Khan AcademyData Infrastructure at AirbnbData Infrastructure at LinkedInData Infrastructure at GO-JEKData Ingestion Infrastructure at PinterestData Analytics Architecture at PinterestBig Data Processing (2 parts) at SpotifyBig Data Processing at UberAnalytics Pipeline at LyftAnalytics Pipeline at GrammarlyAnalytics Pipeline at TeadsML Data Pipelines for Real-Time Fraud Prevention at PayPalBig Data Analytics and ML Techniques at LinkedInSelf-Serve Reporting Platform on Hadoop at LinkedInPrivacy-Preserving Analytics and Reporting at LinkedInAnalytics Platform for Tracking Item Availability at WalmartHALO: Hardware Analytics and Lifecycle Optimization at FacebookRBEA: Real-time Analytics Platform at KingAresDB: GPU-Powered Real-time Analytics Engine at UberAthenaX: Streaming Analytics Platform at UberDelta: Data Synchronization and Enrichment Platform at NetflixKeystone: Real-time Stream Processing Platform at NetflixDatabook: Turning Big Data into Knowledge with Metadata at UberAmundsen: Data Discovery & Metadata Engine at LyftMaze: Funnel Visualization Platform at UberMetacat: Making Big Data Discoverable and Meaningful at NetflixSpinalTap: Change Data Capture System at AirbnbAccelerator: Fast Data Processing Framework at eBayOmid: Transaction Processing Platform at YahooTensorFlowOnSpark: Distributed Deep Learning on Big Data Clusters at YahooCaffeOnSpark: Distributed Deep Learning on Big Data Clusters at YahooSpark on Scala: Analytics Reference Architecture at AdobeExperimentation Platform (2 parts) at SpotifyExperimentation Platform at AirbnbSmart Product Platform at ZalandoLog Analysis Platform at LINEData Visualisation Platform at MyntraBuilding and Scaling Data Lineage at NetflixBuilding a scalable data management system for computer vision tasks at PinterestStructured Data at EtsyScaling a Mature Data Pipeline - Managing Overhead at AirbnbSpark Partitioning Strategies at AirbnbScaling the Hadoop Distributed File System at LinkedInScaling Hadoop YARN cluster beyond 10,000 nodes at LinkedIn