Two Technical Rounds
1st Round is of Pyspark code
2nd Round is based on project
Last round is HR discussion
Interviewer mainly focus on the spark code in dataframe
and sql
Interview questions [1]
Question 1
Split the file words and count frequency in PySpark
Dept wise highest salary in Pyspark
General Introduction.
Discussed my previous tech stack.
Questions around data volume I’ve worked with, nature of my work, and the impact it had.
Coding Problems:
Q1. Longest Substring with Unique Characters
Q2. Check if Two Strings are Anagrams (O(n), no sorting allowed)
SQL Question: From a product pricing table, find product names where the prices are strictly increasing over months.
Interview questions [1]
Question 1
Q1. Basic Spark, Python, SQL Questions.
Q2. Some Data Modelling Questions.
The hiring manager directly reached out to me, give me a take home assessment, it is a web IP related question. Just one question, no time limit. The next step is talk with other coworkers
Interview questions [1]
Question 1
Introduce yourself and describe a difficult project you've been working on
Without even the introduction, three back2back questions of DSA. Anagrams detection, linked list addition and one on tree traversal.
Questions around working of master, worker nodes and data pipelines creation.
Interview questions [1]
Question 1
Anagram detection, addition of Linkedlist nodes values