November 12
• Design, build, and manage scalable data pipelines and ETL processes in Databricks • Integrate data from various systems, ensuring quality and accuracy • Develop and maintain the data lake and warehouse environments • Collaborate with data governance teams to ensure data privacy and security • Work with data scientists and business stakeholders to develop solutions • Identify performance bottlenecks and optimize Databricks workflows • Create and maintain clear documentation for data processes
• Strong experience in SQL, PySpark, CI/CD, CLI, Azure DevOps, Unity Catalog, performance optimization • Experience with parsing and loading semi-structured data (JSON) • Experience with parsing and loading unstructured data (clinical notes, DICOM images) • Ability to set up streaming data in the Azure environment • Prior healthcare experience
• Health • Vision • Dental • 401K plan • Life Insurance • Pretax Commuter Benefits • Incredibly supportive team
Apply Now