Interview asked me to explain my previous projects?
Senior Data Engineer Interview Questions
2,591 senior data engineer interview questions shared by candidates
I was asked to talk about my leadership style.
1. Explain about your projects 2. What and how will optimise spark jobs in your work(Spark related questions) 3. How will justify reliability and durability of data 4. What is Lineage graph and check points in Spark 5. Explain Strategic of Spark Coding 1. Input is an array of integers. There is a sliding window of size W which is moving from the very left of the array to the very right. Each time the sliding window moves rightwards by one position. Find the maximum number from each window. Input array is [1, 3, -1, -3, 5, 3, 6, 7] and Sliding window (W) is 3. Output array is [3, 3, 5, 5, 6, 7] 2. Write a query to provide number of times an employee got increment and max increment he has got along with columns emp_id, emp_name, joining_date, dept_name Additional info: a. An emp may not be tagged to any dept b. An emp may not have got any hike Emp table emp_id | emp_name | joining_date | Dept_id Dept table dept_id | dept_name | Dept_Location | Dept_Manager salary_increase table emp_id | increment_date | inc_amount 3. Write a Python program to find element in a list from current element to next index list is ["Mon","Tue","Wed","Thu","Fri","Sat","Sun"] If current element is “Wed” and next index is 2, result is “Fri” If current element is “Sat” and next index is 23 (cyclic), result is “Mon”
Mi experience with cdc approach challenging
Algoritms and data structures for python and intermediate to advance SQL
Basic concept on architecture and coding question
First question was on palindromic strings of 0s and 1s that I spent too much time on, The second question was "simple" sliding window question on memory allocation. It's funny that I have been in long meetings to discuss this kind of logic, but somehow there was an expectation that I would code this on the spot. Yeah, I failed the tech hazing here. The system design questions were really well-made. I honestly had a lot of fun answering those. Lots of things about availability, architecture, and databases.
A few generic sql questions which were easy. Also a python list problem with an O(n) constraint, about a leetcode medium.
Come filtreresti dati in formato sbagliato?
I was asked about end-to-end Data Engineering project flow and my contribution at each stage with detailed technical discussion. Questions would revolve around the Big Data stack.
Viewing 241 - 250 interview questions