you have to check if customer A has any records in a huge say 100+ billion records table, how will you do in spark without effecting cluster/performance.
Senior Data Engineer Interview Questions
2,608 senior data engineer interview questions shared by candidates
Problem solving using Python arrays, dictionaries
1) Spark questions 2) Scala and Python question 3) AWS questions 4) Coding round
What are some technologies you're familiar with
- Questions on SQL, Data Structures and Normal ETL in pyspark - Spark Optimization Techniques - Resume Discussion - How to handle data quality in Big Data - Apache AIrflow Infrastructure - Discussion on Role based policies
General questions about data streaming and event-driven architecture, when to use tabular vs columnar storage etc
There was a technical home work question to design a ETL process that loads csv files into a SQL system. The files's structures could change and so they had to be handled dynamically.
Live coding was mainly in SQL. Some Python questions pertaining to taking in data from an API. Some lower level engineers were fairly clearly interviewing for the first time, and so they went from full resume review to live coding and back without transitions, as if to take up time allotted to them.
how many years of python experience do you have?
Tell me about the time you had a challenge in a project
Viewing 1831 - 1840 interview questions