SaaS Enterprise Solution β’ Integrated Digital Workspace β’ Better R&D, Faster β’ Big Data Aggregator β’ Digital Research Platform
11 - 50
2 days ago
SaaS Enterprise Solution β’ Integrated Digital Workspace β’ Better R&D, Faster β’ Big Data Aggregator β’ Digital Research Platform
11 - 50
β’ Build, test, and operate automated Extract, Transform, and Load (ETL) pipelines that process terabytes of text data nightly. β’ Develop service frontends around our various backend data stores (AWS Aurora, MySQL, Elasticsearch, S3). β’ Rapidly prototype, test, and deploy data pipelines for LLMs using AWS. β’ Collaborate with data scientists and NLP engineers to understand data requirements for LLMs. β’ Optimize performance, reliability, and scalability of data pipelines and LLMs by applying best practices. β’ Ensure quality, integrity, and security of the data by implementing data validation and governance policies.
β’ Bachelor's degree or higher in computer science, engineering, or a related field. β’ 3+ years of experience in data engineering, preferably with large-scale text data and LLMs and 6+ years of any software engineering experience (including data engineering). β’ Proficient in Python 3 or Java, preferably both. β’ Experience with data modeling, ETL, and data warehouse design and implementation. β’ Expertise with ETL schedulers such as Airflow, Prefect or similar frameworks. β’ Familiar with LLMs and NLP concepts and frameworks such as Transformers, BERT, GPT, PaLM, and LLaMA. β’ Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS. β’ Experience extracting, processing, storing, and querying of petabyte-scale datasets. β’ Familiarity with building and using containers. β’ Familiarity with event-based microservices. β’ Strong communication, collaboration, and problem-solving skills.
Apply Now2 days ago
2 - 10
Data Architect/Engineer designing mobile applications for Meetsta's social platform.
2 days ago
201 - 500
Help grow data systems at O'Reilly Media using Python; support millions of users.
πΊπΈ United States β Remote
π΅ $110k - $138k / year
β° Full Time
π‘ Mid-level
π Senior
π° Data Engineer
2 days ago
51 - 200
Support healthcare system as a Mid-Level Data Engineer at Diverge Health.
2 days ago
201 - 500
Data Engineer responsible for customer data conversion on enterprise software.
2 days ago
10,000+
Mid-level Analytical Engineer for Aboitiz Data Innovation's Data Platform team.