Data Interview Questions

133,162 data interview questions shared by candidates

Spark : 1. Difference between map and flatmap. 2. Difference between groupbykey and reducebykey 3. Which file format spark saves file ? Answer should be ".orc" files only. (It took me multiple attempts to understand the question. and ".txt", ".csv" "xml" "anyflat-file-format" is not the correct answer. LOL ) 4. Difference between coalesce and repartition. 5. some more questions on RDD Hive 1 Difference between orderby and sortby 2. Select * from . How many mapper will be created. 3. Analytical function/ What is RANK ? hdfs 1. Difference between block and split SQL 1. What is indexing on single column ? (stopped in between of my answer ) and followed by what is sorting type in index key ascending or descending. (How does it matter its primary key not composite key) 2. Can you have two primary key . (I do not thing there will need of two primary as one it self does the job.) No tell the implementation is it possible, can you define two primary key ? (No) One of the panel member got irritated and fired a few questions after saying bye and answering my question related to profile and position . (Literally like Panel one : okay bye thanks for you ... Panel: No I will ask one last question . which file format spark saves files. )
avatar

Big Data Consultant

Interviewed at Prowareness

3.8
Mar 3, 2017

Spark : 1. Difference between map and flatmap. 2. Difference between groupbykey and reducebykey 3. Which file format spark saves file ? Answer should be ".orc" files only. (It took me multiple attempts to understand the question. and ".txt", ".csv" "xml" "anyflat-file-format" is not the correct answer. LOL ) 4. Difference between coalesce and repartition. 5. some more questions on RDD Hive 1 Difference between orderby and sortby 2. Select * from . How many mapper will be created. 3. Analytical function/ What is RANK ? hdfs 1. Difference between block and split SQL 1. What is indexing on single column ? (stopped in between of my answer ) and followed by what is sorting type in index key ascending or descending. (How does it matter its primary key not composite key) 2. Can you have two primary key . (I do not thing there will need of two primary as one it self does the job.) No tell the implementation is it possible, can you define two primary key ? (No) One of the panel member got irritated and fired a few questions after saying bye and answering my question related to profile and position . (Literally like Panel one : okay bye thanks for you ... Panel: No I will ask one last question . which file format spark saves files. )

In interview they asked questions 1. Write test cases for ceiling fan. 2. what is structured data and unstructured 3. Tell about yourself. 4. Gave me test case of cab with table of cab no, driver name, trip start time, trip end time and driver duty start time and end time. Asked to draw analysis out of this info. 5. final round was with MD over skype and he also asked many questions based on CV and some test ceases. how you ensure quality data from vendor. how to check data is updated or not.
avatar

Trainee Data Analyst

Interviewed at Obsessory

2.6
Apr 3, 2017

In interview they asked questions 1. Write test cases for ceiling fan. 2. what is structured data and unstructured 3. Tell about yourself. 4. Gave me test case of cab with table of cab no, driver name, trip start time, trip end time and driver duty start time and end time. Asked to draw analysis out of this info. 5. final round was with MD over skype and he also asked many questions based on CV and some test ceases. how you ensure quality data from vendor. how to check data is updated or not.

Viewing 1371 - 1380 interview questions

Glassdoor has 133,162 interview questions and reports from Data interviews. Prepare for your interview. Get hired. Love your job.