Craft demo mostly demonstrating spark working knowledge, distributed systems.
Sr Data Engineer Interview Questions
2,606 sr data engineer interview questions shared by candidates
Sort 2 already sorted arrays into one array
Design LRU cache
What are the different types of slowly changing dimensions?
Implement scalable topK words for Amazon product descriptions using count-min sketch.
Questions are mostly on optimization and product sense
About project design and implementation
A: They asked me to walk through a real-world Spark use case I’d implemented—building a streaming ETL for clickstream data—detailing how I managed stateful windowed aggregations, tuned `spark.sql.shuffle.partitions`, and fixed data skew with a custom partitioner.
What are your experiences in data engineering?
"They asked me to design an end-to-end data pipeline that supports both batch and real-time processing using tools like Kafka, Spark, and a cloud platform like AWS or Azure. They were particularly interested in how I’d handle schema evolution and ensure data quality at each stage."
Viewing 2051 - 2060 interview questions