How do you Handle skewness, a coding exercise in spark, questions of partitionby , repartition, shuffle and sort, catalyst optimizer?
Big Data Engineer Interview Questions
1,228 big data engineer interview questions shared by candidates
sql queries, list and array manipulations, dynammic programming
1-SQL output of different joins.
About my background and follow up questions
Overlapping Intervals and Second appearance of every character in a String
Past projects with ETL. Theory questions about Mapreduce, hadoop, spark. Some of these questions were quite obscure like: How does spark process skewed data?
Mostly questions on the project and the concepts revolving around those
Times X{open, close} , Y{open, close} rec_type, status, time x1, open, 930 x1, close 1030 x2, open, 1035 y1, open, 1040 y2, open, 1041 x2, close, 1100 x3, open, 1110 x3, close, 1115 y1, close, 1120 y2, close, 1121 |---x1, open, 930 |---x1, close 1030 |-----x2, open, 1035 | y1, open, 1040----| | y2, open, 1041 ---+---| |-----x2, close, 1100 | | |---x3, open, 1110 | | |---x3, close, 1115 | | y1, close, 1120---| | y2, close, 1121-------| Find the pairs of x-type and y-type where they have any time overlap between them.
Tell me about your research/projects that are related to big data
What is the difference between SQL and NoSQL database?
Viewing 921 - 930 interview questions