What is BigQuery. What is Partitioning and Bucketing What is Dataflow with example
Sr Data Engineer Interview Questions
2,595 sr data engineer interview questions shared by candidates
2 rounds of technical interview plus 1 hr discussion revolving around projects,past experiences, questions on skills and technology
1> Technical questions related to job role 2> Current project work flows 3> Some programming and Infra related questions 4> English test including speaking, writing and others
Explain Hbase minor and major compaction
Given a list of domains, that are entered sorted, we want to minimize the output of urls by uniting sub-domains by only outputting their common ancestor. Also sibling domains should be listed as their common ancestor.
Data Engineering questions: Explain data solution Toughest tasks you had a a a data engineer
The HR interview was just a relaxed chat based around values and general compatibility for the role. They ensured that you have done your reading around the company and role. The tech challenge was python based, using a provided API to gather/filter/resubmit data. You had to upload the solution along with appropriate tests to Git. The engineer interview didn’t cover the tech challenge solution at all (surprisingly). It was a general discussion on past experience and projects. Later it switched gears to some more general questions like ‘what is etl?’ or ‘tell us about data streaming products?’ or ‘what problems have you had using semi-structured or unstructured data?’. AWS experience and services used was also covered. The whiteboard session I found the hardest part and was again scenario based where you had to diagnose problems in a process and work with the other engineers to come up with a solution. Later (once the main problem had been covered) more requirements/problems were added to see how you thought on your feed and you were questioned on which products you would use (and why)
SQL Query Writing: **Table A** Columns: col1 col2 **Table B** Columns: col3 col4 There is no common column to implement the JOIN condition and the Required Output is: col1, col2, col3, col4
SQL Query Writing: **Table EMP** Input Columns: EMPID, DEPID, SAL, DOJ Output Columns: EMPID, DEPID, SAL, DOJ, NEXT_SAL NEXT_SAL can be derived as the salary of immediate next employee in the given records.
Which AWS services you have used (Hands-On)? Tell me one use case and one limitation of AWS Lambda function?
Viewing 741 - 750 interview questions