what are window functions, explain rank and dense rank explain joins and no of outputs for each join for a given table, where vs having clause second highest salary for each department partition vs bucketing, when to choose each, python code to remove duplicate from a list list vs tuple vs dict. py thon code to find combinations from list [2,3,5] where sum is 8.
Senior Data Engineer Interview Questions
2,560 senior data engineer interview questions shared by candidates
Previous projects related questions. Some cross question.
SQL: window queries Python: priority queues and min-heaps SD: wallet payments
Lots of intro how to build data pipelines. Not very sophisticated stuff.
Draw DAG graph for particular spark program execution.
Asked questions on Software Development process. Asked questions on Software testing, especially unit testing. What are the challenges you experienced in data transformation with parquet datatype
Nothing particularly very difficult. An open-ended design question followed by a coding question. I felt like I could have give a better answer for the design problem but compared to the rest of the interview that wasn't the worst part.
RDD vs DF - > 1 gb file & 500mb file. What you will use for each one and why
Diff between parquet & CSV and what is the reason for preferring parquet in spark
No dynamic programming-style nonsense. The interviews had been so reasonable that only when I got the offer I recognized how big the company was.
Viewing 861 - 870 interview questions