What metrics do you use to evaluate if the data you prepared to train a machine learning model ?
Senior Data Engineer Interview Questions
2,607 senior data engineer interview questions shared by candidates
Past experience on model deployment
Core concepts from mechanical domain
Initial Interview: Cultural and general tech questions Tech interview: General Python technical questions CTO interview: Motivation, experience and professional interests questions
What feedback did your manager give you
A developer on the team was running the following line in a function for logging purposes, and the job crashed with an "out of memory" exception. The developer says that the cluster has many workers with a lot of memory and disk and still the job crashes. Can you help explain how come to this line makes the job crash with OOM even though the cluster is huge? def someFunc(): for row in df.collect(): print(f'Customerr{row["customer"]} => Paid {row["price"]}')
Describe a project you are particularly proud of
sql queries and pyspark questions and on your projects
SQL queries medium to complex Spark concepts and scenario based questions
persist vs cache? what does init do? args vs kargs? json vs parquet? columnar storage? distributed systems? spark dataframe vs pandas dataframe
Viewing 1881 - 1890 interview questions