Databricks Data Engineer
Talascend is currently seeking a Databricks Data Engineer for a remote, contract opportunity with our client.
Overview
The role involves developing and supporting new and existing data pipelines and data analytics environments in Azure cloud-based data lake. The data engineer will translate business requirements into data engineering solutions to support an enterprise scale Microsoft Azure based data analytics platform.
Shift
Remote with quarterly travel for 2 days to Gaithersburg, MD or Indianapolis, Indiana. Standard work hours.
Clearance
Ability to obtain a Public Trust Security Clearance
US Citizen
Responsibilities
- Design, build, and optimize scalable data solutions using Databricks and Medallion Architecture.
- Manage ingestion routines for processing multi-terabyte datasets efficiently for multiple projects simultaneously.
- Integrate data from various structured and unstructured sources to enable high-quality business insights.
- Implement effective data management strategies to ensure data integrity, availability, and accessibility.
- Identify opportunities for cost optimization in data storage, processing, and analytics operations.
- Monitor and support user requests, addressing platform or performance issues, cluster stability, Spark optimization, and configuration management.
- Collaborate with the team to enable advanced AI-driven analytics and data science workflows.
- Integrate with various Azure services including Azure Functions, Storage Services, Data Factory, Log Analytics, and User Management for seamless data workflows.
- Provision and manage infrastructure using Infrastructure-as-Code (IaC).
- Apply best practices for data security, data governance, and compliance, ensuring support for federal regulations and public trust standards.
- Proactively collaborate with technical and non-technical teams to gather requirements and translate business needs into data solutions.
Qualifications
- US Citizenship
- BS degree in Computer Science or related field and 3+ years of experience
- Master’s degree with 2+ years of experience
- 3+ years of experience developing and designing ingestion flows using cloud platform services with data quality
- Databricks Data Engineer certification and 2+ years of experience maintaining Databricks platform and development in Spark
- Ability to work directly with clients and act as front line support for requests coming in from clients.
- Clearly document and express the solution in form of architecture and interface diagrams.
- Proficient at Python, Spark, and R are essential.
- .NET based development is a plus.
- Knowledge and experience with data governance, including metadata management, enterprise data catalog, design standards, data quality governance, and data security.
- Experience with Agile process methodology, CI/CD automation, and cloud-based developments (Azure, AWS).
Preferred Qualifications
- Certifications in Azure cloud.
- Knowledge of FinOps principles and cost management.
We thank all applicants for their interest. However, only those qualified individuals who closely meet the qualifications of the position will be contacted. The details of the position are only a summary, other duties may be assigned as necessary.
Background Check and Drug Screen may be required.
Pay range is not a guarantee of compensation or salary, as final offer amount may vary based on factors including but not limited to experience and geographic location. Talascend also offers a variety of benefits including: health and disability insurance, 401(k), EAP, paid time off, and company-paid holidays. The specific programs and options available to an employee may vary depending on date of hire, plan requirements, schedule type, and client work site mandates.
Talascend is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.
