Recruit Panda
Senior Data Engineer - Python/Scala/Azure
Job Location
mohali, India
Job Description
About the role : We are seeking a Senior Data Engineer skilled in Databricks, Python, Scala, Azure Synapse and Azure Data Factory to join our team of data engineers within Greystar Information Technology. This team serves Greystar by ingesting data from multiple sources, making it available to internal stakeholders, and by interfacing with and exchanging data between a variety of internal and external systems. You will be responsible for building and enhancing our Enterprise Data Platform (EDP) which is built within the Azure cloud and utilizes modern processes and technologies such as Databricks, Synapse, Azure Data Factory (ADF), ADLS Gen2 Data Lake, Azure DevOps and CI/CD pipelines. You will develop, deploy and troubleshoot complex data ingestion pipelines and processes. Your curious mind and attention to detail will be an asset, as will your extensive knowledge and experience in the data engineering space. JOB DESCRIPTION : How you will make in impact : - Design, develop, optimize, and maintain data architecture and pipelines that adhere to ETL principles and business goals - Collaborate with data engineers, data consumers, and other team members to come up with simple, functional, and elegant solutions that balance the data needs across the organization - Solve complex data problems to deliver insights that helps the organization achieve its goals - Create data products that will be used throughout the organization - Advise, consult, mentor and coach other data and analytic professionals on data standards and practices - Foster a culture of sharing, re-use, design for scale stability, and operational efficiency of data and analytic solutions - Develop and deliver documentation on data engineering capabilities, standards, and processes; participate in coaching, mentoring, design reviews and code reviews - Partner with business analysts and solutions architects to develop technical architectures for strategic enterprise projects and initiatives. - Deliver awesome code Technical Qualifications : - 9-12 years relevant and progressive data engineering experience - Deep Technical knowledge and experience in Databricks, Python, Scala, Microsoft Azure architecture and platform including Synapse, ADF (Azure Data Factory) pipelines and Synapse stored procedures - Hands-on experience working with data pipelines using a variety of source and target locations (e.g., Databricks, Synapse, SQL Server, Data Lake, file-based, SQL and No-SQL database) - Experience in engineering practices such as development, code refactoring, and leveraging design patterns, CI/CD, and building highly scalable data applications and processes - Experience developing batch ETL pipelines; real-time pipelines are a plus - Knowledge of advanced data engineering concepts such as dimensional modeling, ETL, data governance, data warehousing involving structured and unstructured data - Thorough knowledge of Synapse and SQL Server including T-SQL and stored procedures - Experience working with and supporting cross-functional teams in a dynamic environment - A successful history of manipulating, processing and extracting value from large disconnected datasets. - Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. - Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. - Knowledge and understanding of Boomi is a plus Additional Qualifications and Experience : - Excellent problem-solving skills and experience - Effective communication skills - Strong collaboration skills - "Self-starter" attitude and the ability to make decisions with minimal guidance from others - Innovative and passionate about your work and the work of your teammates - Ability to comprehend and analyze operational systems and ask appropriate questions to determine how to improve, migrate or modify the solution to meet business needs - Experience with data ingestion and engineering, specifically involving large data volumes - Knowledge of CI/CD release pipelines is a plus - Understanding of Python and knowledge of parallel processing frameworks like MapReduce, Spark, Scala - Knowledge of the Agile development process (ref:hirist.tech)
Location: mohali, IN
Posted Date: 11/24/2024
Location: mohali, IN
Posted Date: 11/24/2024
Contact Information
Contact | Human Resources Recruit Panda |
---|