Lead Engineer Interview Questions

7,930 lead engineer interview questions shared by candidates

1st Round How to run an adf trigger on the last working day of the month? Why not using databricks warehouse? Pyspark coding level optimization Pyspark withcolumn and when-otherwise syntax. Will I need to Unpersist with job cluster?SQL - Top third highest salary Have you led a team? How large, who did the requirement gathering? Agile methodology 2nd Round How did u connect to SAP from ADF? - Open hub destination? Partitioning, Photon, liquid clustering, Z ordering. Will Z ordering work on a column if it is multiline(others are string)? Databricks workflow for which use case? Unity Catalogue -> How to migrate existing workspace to it? How to read Data from tables located in some other region? Given two large fact tables identify if they are exactly the same. Ans. indexing - hash and compare. Given two large fact tables with join, how to optimise for performance? Real time use case - how ingesting and orchestrating for million of records. Authorisation for the API. Write strategies for large tables - Size of your table/data. Different join strategies. Un nesting and reading of JSON. Details and e.g. Explode,Split,etc Struct type , Struct Field Setup and Run Python outside databricks environment. SQL based on windows functions with cumulative sum. Another on self join (Leetcode medium) 3rd Round - Liquid clustering, z-order, partition - Write strategies for large table - Size of your table/data - Data Mesh - Setup and Run Python outside databrciks environment - Tier of AAS and ADB? How to optimize? - How did u connect to SAP from ADF? - Open hub destination? - Unity Catalogue -> How to migrate existing workspace to it?, How to read Data from table located in some other region? - RLS - Trained or used AWS/Snowflake/GCP?
avatar

Lead Data Engineer

Interviewed at Diggibyte Technologies

4.2
Feb 12, 2025

1st Round How to run an adf trigger on the last working day of the month? Why not using databricks warehouse? Pyspark coding level optimization Pyspark withcolumn and when-otherwise syntax. Will I need to Unpersist with job cluster?SQL - Top third highest salary Have you led a team? How large, who did the requirement gathering? Agile methodology 2nd Round How did u connect to SAP from ADF? - Open hub destination? Partitioning, Photon, liquid clustering, Z ordering. Will Z ordering work on a column if it is multiline(others are string)? Databricks workflow for which use case? Unity Catalogue -> How to migrate existing workspace to it? How to read Data from tables located in some other region? Given two large fact tables identify if they are exactly the same. Ans. indexing - hash and compare. Given two large fact tables with join, how to optimise for performance? Real time use case - how ingesting and orchestrating for million of records. Authorisation for the API. Write strategies for large tables - Size of your table/data. Different join strategies. Un nesting and reading of JSON. Details and e.g. Explode,Split,etc Struct type , Struct Field Setup and Run Python outside databricks environment. SQL based on windows functions with cumulative sum. Another on self join (Leetcode medium) 3rd Round - Liquid clustering, z-order, partition - Write strategies for large table - Size of your table/data - Data Mesh - Setup and Run Python outside databrciks environment - Tier of AAS and ADB? How to optimize? - How did u connect to SAP from ADF? - Open hub destination? - Unity Catalogue -> How to migrate existing workspace to it?, How to read Data from table located in some other region? - RLS - Trained or used AWS/Snowflake/GCP?

Viewing 7581 - 7590 interview questions

Glassdoor has 7,930 interview questions and reports from Lead engineer interviews. Prepare for your interview. Get hired. Love your job.