SQL and coding questions in 1 hour using coderpad
Senior Data Engineer Interview Questions
2,552 senior data engineer interview questions shared by candidates
1. Explain about your projects 2. What and how will optimise spark jobs in your work(Spark related questions) 3. How will justify reliability and durability of data 4. What is Lineage graph and check points in Spark 5. Explain Strategic of Spark Coding 1. Input is an array of integers. There is a sliding window of size W which is moving from the very left of the array to the very right. Each time the sliding window moves rightwards by one position. Find the maximum number from each window. Input array is [1, 3, -1, -3, 5, 3, 6, 7] and Sliding window (W) is 3. Output array is [3, 3, 5, 5, 6, 7] 2. Write a query to provide number of times an employee got increment and max increment he has got along with columns emp_id, emp_name, joining_date, dept_name Additional info: a. An emp may not be tagged to any dept b. An emp may not have got any hike Emp table emp_id | emp_name | joining_date | Dept_id Dept table dept_id | dept_name | Dept_Location | Dept_Manager salary_increase table emp_id | increment_date | inc_amount 3. Write a Python program to find element in a list from current element to next index list is ["Mon","Tue","Wed","Thu","Fri","Sat","Sun"] If current element is “Wed” and next index is 2, result is “Fri” If current element is “Sat” and next index is 23 (cyclic), result is “Mon”
Algoritms and data structures for python and intermediate to advance SQL
Mi experience with cdc approach challenging
First question was on palindromic strings of 0s and 1s that I spent too much time on, The second question was "simple" sliding window question on memory allocation. It's funny that I have been in long meetings to discuss this kind of logic, but somehow there was an expectation that I would code this on the spot. Yeah, I failed the tech hazing here. The system design questions were really well-made. I honestly had a lot of fun answering those. Lots of things about availability, architecture, and databases.
A few generic sql questions which were easy. Also a python list problem with an O(n) constraint, about a leetcode medium.
Basic concept on architecture and coding question
How would you handle multiple syslog from many systems?;
2nd Round: Advanced SQL, subscription based problem and to explain data schema design and data modelling.
How do you know which Azure SQL Solution to use and when? Have you found any specific functionality that works in one solution and not in another?
Viewing 241 - 250 interview questions