I applied through college or university. I interviewed at Anonymous Content (Bengaluru) in Aug 2025
Interview
Round 1 - Online Assessment based CN,OS,DBMS , and 2 DSA questions
Round 2 - Technical Round
C++ , JAVA,OOPS, They asked OOPS concepts in detail Polymorphism l, Abstraction, Inheritance, Encapsulation,
Different types of inheritance and DSA questions
It was easy and prepare Advanced Topics
What are accumulators and broadcast variables? Use cases?
What is Tungsten and Catalyst Optimizer in Spark?
Difference between cache and persist.
What is checkpointing? When do you use it?
Interview questions [1]
Question 1
Find duplicate rows in a PySpark DataFrame.
Remove duplicates but keep the latest row (based on timestamp).
Find employees who logged in for 3 consecutive days.
Pivot sales data: rows (month, sales) → columns (Jan, Feb, Mar…).
Explode JSON column (with arrays) into multiple rows.
Read data from Kafka using PySpark Structured Streaming.
Write a PySpark job that increments data daily using partition pruning.