Spark, AWS, Python, Data Engineering
Sr Data Engineer Interview Questions
2,595 sr data engineer interview questions shared by candidates
1) Rank over partition 2) SQL joins and filtering
what type of join do you use to join a big dataset with a small dataset in spark?
How to handle scaling up and down in Kafka? how to handle decreasing the number of partitions in each topic?
Explain the spark jobs that you have written in your project and why?
Name the different materializations in dbt along with when each should be used.
1.joins output 2.spark internal 3.copy activity in adf on premise 4.scd, data modeling 5. Project architecture
medium difficulty question from Coderbyte. The question was very easy just by looking at it you will build the logic instantly, but I couldn't convert the time string into appropriate numbers, which led to code not working and later got rejected. I approached it correctly and explained my logic very well however i didn't remember to convert the time string into an appropriate numerical format which resulted in code not working correctly.
A system design interview around a real-time leaderboard system
. . . . .
Viewing 441 - 450 interview questions