Data Engineering Interview Questions

56,373 data engineering interview questions shared by candidates

You have a single branch of a GIT or SVN repository. At some commit number G, the code in the repo worked (G = good commit). At a later commit number B, you noticed that the code was broken (B = bad). The code may have become broken anytime at or before B, but after G. B was just the first time that you noticed the code was broken. Both G and B are positive integers and G < B. You have access to a function.......bool isCommitGood(integer)......... that takes a commit number and tells you whether the code at that commit worked or was broken. Write code to find the most recent working commit (i.e., the number/index of the last good commit before the code became broken).
avatar

Software Engineering New Grad

Interviewed at Meta

3.6
Oct 27, 2015

You have a single branch of a GIT or SVN repository. At some commit number G, the code in the repo worked (G = good commit). At a later commit number B, you noticed that the code was broken (B = bad). The code may have become broken anytime at or before B, but after G. B was just the first time that you noticed the code was broken. Both G and B are positive integers and G < B. You have access to a function.......bool isCommitGood(integer)......... that takes a commit number and tells you whether the code at that commit worked or was broken. Write code to find the most recent working commit (i.e., the number/index of the last good commit before the code became broken).

Data Model: LinkedIn data model - model for 1st degree connection Python: 1. Dedup items in the list - retaining the order of items. ==> cannot use dict/set since order will not be retained. Follow up question on this - How would you handle nested lists? (they are looking for recursion) 2. Find the number of words in a sentence /avg length of word SQL On the product-sales-customers data model that is preloaded in coderpad.io, write the following queries 1. Count of stores in OR state with area_sqft > 25000 2. avg number of Female Customers group by state 3. Customer FirstName, Last Name and count of unqiue products purchased by state followup question - Return the top customer by state based on diverse product purchased (diverse = count(distinct product_id))
avatar

Senior Data Engineering Manager

Interviewed at Meta

3.6
Aug 26, 2020

Data Model: LinkedIn data model - model for 1st degree connection Python: 1. Dedup items in the list - retaining the order of items. ==> cannot use dict/set since order will not be retained. Follow up question on this - How would you handle nested lists? (they are looking for recursion) 2. Find the number of words in a sentence /avg length of word SQL On the product-sales-customers data model that is preloaded in coderpad.io, write the following queries 1. Count of stores in OR state with area_sqft > 25000 2. avg number of Female Customers group by state 3. Customer FirstName, Last Name and count of unqiue products purchased by state followup question - Return the top customer by state based on diverse product purchased (diverse = count(distinct product_id))

Viewing 731 - 740 interview questions

Glassdoor has 56,373 interview questions and reports from Data engineering interviews. Prepare for your interview. Get hired. Love your job.