SaaS Enterprise Solution β’ Integrated Digital Workspace β’ Better R&D, Faster β’ Big Data Aggregator β’ Digital Research Platform
October 20
SaaS Enterprise Solution β’ Integrated Digital Workspace β’ Better R&D, Faster β’ Big Data Aggregator β’ Digital Research Platform
β’ Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly. β’ Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3). β’ Rapidly prototype, test, and deploy data pipelines for LLMs using AWS. β’ Collaborate with data scientists and NLP engineers to understand data requirements for LLMs. β’ Optimize performance, reliability, and scalability of data pipelines and LLMs by applying best practices. β’ Ensure quality, integrity, and security of the data by implementing data validation and governance policies.
β’ Bachelor's degree or higher in computer science, engineering, or a related field. β’ 3+ years of experience in data engineering, preferably with large-scale text data and LLMs and 6+ years of any software engineering experience (including data engineering). β’ Proficient in Python 3 or Java, preferably both. β’ Experience with data modeling, ETL, and data warehouse design and implementation. β’ Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks. β’ Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA. β’ Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS. β’ Experience extracting, processing, storing, and querying of petabyte-scale datasets. β’ Familiarity with building and using containers. β’ Familiarity with event-based microservices. β’ Strong communication, collaboration, and problem-solving skills.
Apply NowOctober 20
Data Architect/Engineer designing mobile applications for Meetsta's social platform.
October 19
Data Engineer at Career.io to shape data infrastructure and drive decisions.
October 17
Data Engineer to design and optimize data systems for analytics and machine learning.
October 17
Design, develop, and optimize data systems for advanced analytics and AI-driven insights.
October 17
Data Engineer for CDC Foundation's public health data infrastructure development.
πΊπΈ United States β Remote
π΅ $103.5k - $143.5k / year
β° Full Time
π‘ Mid-level
π Senior
π° Data Engineer
π¦ H1B Visa Sponsor