Big Data Internship Interview Questions

1,784 big data internship interview questions shared by candidates

SQL Assessment Questions: The data model below shows relationships between tables. You will need to understand these tables to develop SQL queries that answer the questions. All columns except Primary Key (PK) columns should be assumed to be nullable. Sample data for each of the tables (Sales, Product, SalesReturnReason) is shown below. A complete set of data is included in the attached Excel Spreadsheet. This spreadsheet data may be imported into your database of choice. Question 1 Write a SQL Query that will return the total amount of sales by product category, for the month of December 2013. • Your query should return 2 columns: ProductCategory, SalesAmount. • Your query should produce a 2-row result set, including the following row and 1 additional row that is not shown: Question 2 Write a SQL Query that returns the total amount of returns for reasons that were our fault (OurFault = “Y”) regardless of whether the items were physically returned (Returned = “Y”) • Your query should return one column: Sales. • Your query should produce a 1-row result set looking like the following: Question 3 Write a SQL Query that will show total sales for the top 4 product subcategories, including ties (see note below) based on 2013 sales. • A NULL ProductSubCategory should be listed as “Unknown”. • In the event of a tie within the top 4, both records should be included. • Your query should return 3 columns: ProductSubCategory, Sales and SalesRank. • Your query should produce a 4-row result set including the following 2 records and 2 additional rows that are not shown: Question 4 Write a SQL Query that will show all transactions where a customer purchased a Hydration Pack (ProductSubCategory = " Hydration Packs") and their previous purchase -- based on TransactionTimestamp -- was a Water Bottle or Cage (ProductSubCategory = "Bottles and Cages"). • Your query should return 6 columns: TransactionID, CustomerName, ProductSubCategory, PriorProductSubCategory, TransactionTimestamp, and SalesAmount. • Your query should produce a 3 row result set that includes these 2-rows and 1 additional row that is not shown: Question 5 Write a SQL Query that will return total sales by product category and year. • Write the SQL query without using a CASE statement • A NULL ProductCategory should be listed as “Unknown”. • Your query should return 4 columns: ProductCategory, 2012, 2013, and 2014. Each of the year columns will show the total sales amount for that year. • Your query should produce a 4-row result set including the 2 rows below, and 2 additional rows that are not shown:
avatar

Big Data Architect

Interviewed at Centric Consulting

3.9
Jun 22, 2018

SQL Assessment Questions: The data model below shows relationships between tables. You will need to understand these tables to develop SQL queries that answer the questions. All columns except Primary Key (PK) columns should be assumed to be nullable. Sample data for each of the tables (Sales, Product, SalesReturnReason) is shown below. A complete set of data is included in the attached Excel Spreadsheet. This spreadsheet data may be imported into your database of choice. Question 1 Write a SQL Query that will return the total amount of sales by product category, for the month of December 2013. • Your query should return 2 columns: ProductCategory, SalesAmount. • Your query should produce a 2-row result set, including the following row and 1 additional row that is not shown: Question 2 Write a SQL Query that returns the total amount of returns for reasons that were our fault (OurFault = “Y”) regardless of whether the items were physically returned (Returned = “Y”) • Your query should return one column: Sales. • Your query should produce a 1-row result set looking like the following: Question 3 Write a SQL Query that will show total sales for the top 4 product subcategories, including ties (see note below) based on 2013 sales. • A NULL ProductSubCategory should be listed as “Unknown”. • In the event of a tie within the top 4, both records should be included. • Your query should return 3 columns: ProductSubCategory, Sales and SalesRank. • Your query should produce a 4-row result set including the following 2 records and 2 additional rows that are not shown: Question 4 Write a SQL Query that will show all transactions where a customer purchased a Hydration Pack (ProductSubCategory = " Hydration Packs") and their previous purchase -- based on TransactionTimestamp -- was a Water Bottle or Cage (ProductSubCategory = "Bottles and Cages"). • Your query should return 6 columns: TransactionID, CustomerName, ProductSubCategory, PriorProductSubCategory, TransactionTimestamp, and SalesAmount. • Your query should produce a 3 row result set that includes these 2-rows and 1 additional row that is not shown: Question 5 Write a SQL Query that will return total sales by product category and year. • Write the SQL query without using a CASE statement • A NULL ProductCategory should be listed as “Unknown”. • Your query should return 4 columns: ProductCategory, 2012, 2013, and 2014. Each of the year columns will show the total sales amount for that year. • Your query should produce a 4-row result set including the 2 rows below, and 2 additional rows that are not shown:

1 maths (simple probability) question, 1 Unix question (really simple stuff, such as how to remove the first column of a data set in a text file in bash), 1 "design a program in a language of your choice" question with some simple tasks like printing out the first Fibonacci numbers over 1000, and finally a question about SQL queries. On the whole, the test was very easy. It did not give you the opportunity to do anything smart. You either knew the answers, or you didn't. There were no questions that required any thought. One question was literally "what is the combination of keys to close emacs or vim?". The test in no way attempted to gauge your ability to think logically or learn new concepts quickly.
Apr 24, 2015

1 maths (simple probability) question, 1 Unix question (really simple stuff, such as how to remove the first column of a data set in a text file in bash), 1 "design a program in a language of your choice" question with some simple tasks like printing out the first Fibonacci numbers over 1000, and finally a question about SQL queries. On the whole, the test was very easy. It did not give you the opportunity to do anything smart. You either knew the answers, or you didn't. There were no questions that required any thought. One question was literally "what is the combination of keys to close emacs or vim?". The test in no way attempted to gauge your ability to think logically or learn new concepts quickly.

Viewing 121 - 130 interview questions

See Interview Questions for Similar Jobs

Glassdoor has 1,784 interview questions and reports from Big data internship interviews. Prepare for your interview. Get hired. Love your job.