Software Engineer - Data Engineering and Machine Learning

August 26

Apply Now

Description

β€’ Design, construct, and maintain data pipelines to combine large volumes of geospatial, climate + weather, and electric utility datasets β€’ Work with a cross-functional team to deliver data in support of analytic and ML pipelines β€’ Develop deep familiarity with electric utility datasets and take ownership of integration of new datasets into our existing environments β€’ Contribute to ML model development in the context of understanding future extreme weather impacts on the power grid β€’ Optimize storage and ETL pipelines β€’ Develop versioned, scalable, repeatable and reliable pipelines for utility data that is in GIS and Tabular format to Delta Lake format β€’ Scale & automate data pipelines for statistical analysis for internal and external use-cases β€’ Standardize and scale multi-tenant data storage β€’ Exceptional ability to diagnose data issues and discrepancies β€’ Ability to modularize different stages of data ingestion and verification β€’ Ability to write algorithms for data sanity checks and classification of different data elements β€’ Ability to develop heuristics and suggestions for missing data items β€’ Ability to validate and test pipelines and write functional test to validate the pipelines

Requirements

β€’ Exceptional Python programming skills β€’ Exceptional programming skills with NumPy, SciPy, Xarrays β€’ Exceptional programming skills with frameworks like Dagster or Airflow or Prefect β€’ Exceptional programming skills with Databricks or Apache Spark or Amazon EMR or Cloudera β€’ Deep expertise in storage optimization and partitioning on RDS, Postgres, PostGIS, Delta Lake β€’ Hands on with GIS dataset and QGIS or ESRI β€’ Hands on experience with multi-dimensional Climate or Weather data β€’ Familiarity or hands on experience with Secure Cloud Development β€’ Exposure or experience with electric utility tech stacks (AMS, OMS, GIS, etc.) β€’ Exposure to applied ML and Data Engineering in the context of electric utilities

Benefits

β€’ Unlimited time off β€’ Stock options β€’ Excellent health, dental, and vision β€’ 401k

Apply Now

Similar Jobs

February 16

Datavant

201 - 500

Data Engineer for a healthcare data logistics company specializing in ETL processes.

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com