Duties The ENS automation team is looking for a software developer to assist in the automation of data governance and data auditing work. o Work alongside network tooling team to build auditing rules to improve network systems' data quality.
o Pull data from various systems utilizing MySQL, Hive, Presto, and Thrift
o Schedule automated pipeline jobs for data extraction and storage using Python.
o Develop the auto audit system that executes data audit periodically
o Troubleshoot, maintain, and optimize queries and pipelines for maximum performance and reliability.
o Design and build data visualizations in Tableau to measure impact of data quality improvement efforts.
o Perform documentation tasks as necessary
o Python development
o Must be comfortable with fast-paced multi-system/tool environment which is quickly changing
o Must independently familiarize themselves with the diverse data eco-system across Facebook
o Experienced in developing and landing Python code while exercising best practices
o Experienced in building and maintaining python based data pipelines
o Be able to gather data from multiple sources and compile it into an easy to understand format and structure
o Ability to run and troubleshoot SQL queries to facilitate data analysis and storage
o Ability to develop, maintain, and troubleshoot Tableau Dashboards
o Ability to work efficiently with cross function teams to develop new reporting based on in-depth understanding of customer needs and requirements
No Degree required.
Your candidate must submit these pre-screen questions:
Q: You have recently completed a dashboard and soon you will be presenting it for feedback. What questions do you ask yourself before presenting? Q: Users are reporting that one of your team's supported dashboards is reporting incorrect metrics/figures vs what they see in the tool itself. The end to end flow of data that supports this dashboard is as follows: Tool's MySQL DB → Hive Table → Tableau. At a high level and keeping it brief, what steps would you take to address this inconsistency?
Q: Give a few examples illustrating the difference between Bayesian vs. Non-Bayesian statistics without using a coin flipping analogy. You are not expected to know Bayesian statistics and you should go read up before answering.
Q: Given two input a = [3, 4, 5] and b = [9, 4], add them together in a pair-wise way and output c = [4, 3, 9]
Q: You're asked by your manager to deliver a complex project on an unreasonably short deadline. How do you respond to the request?