“What is data shuffling and how does it impact performance?” “What’s data skew?” “How would you resolve it?” “What is Kubernetes?”
Senior Data Engineer Interview Questions
2,613 senior data engineer interview questions shared by candidates
About project in details. Python Pyspark Aws cloud
2nd highest salary, broadcast join, catalyst optimizer, salting
Very broad questions of system design around tracks and reports website traffic.
Based on pyspark dataframe manipulations (withColumn, conditions, explode, etc) Be interactive.
What are some things you would like to change within your team?
spark cluster confuguration to process 1 TB of file 2. quetions on data modelling 3.GCP related quetions 4 How do you implement logging 5.SMB join in hive along with this few scenario based quetions and some architecture related quetions on cloud and kubernetes
Basics on spark coding and its architecture
Asking Sql and python script questions.
Data models , Schema, Sql, Cloud
Viewing 1631 - 1640 interview questions