ML Data Engineer II

January 15

🌵 Arizona – Remote

info

🏄 California – Remote

info

+18 more states

info

💵 $126k - $196k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

🚰 Data Engineer

🦅 H1B Visa Sponsor

Apply Now
Logo of Scribd

Scribd

Scribd is a digital document library offering access to over 195 million documents across various topics and niches. It serves as a platform for users to discover, upload, and share a diverse array of content, including academic papers, legal documents, manuals, and more. Through a subscription, users can enjoy an ad-free experience and have the ability to download documents for offline use. Scribd also includes access to Everand, which provides millions of ebooks, audiobooks, podcasts, magazines, and sheet music. Additionally, SlideShare within Scribd allows access to millions of community-uploaded presentations and professional documents. With its comprehensive library, Scribd caters to those seeking specialized knowledge and inspiration in nearly every field.

ebooks • publishing • subscription service • books • literary community

201 - 500 employees

Founded 2007

🛍️ eCommerce

📱 Media

☁️ SaaS

💰 $58M Private Equity Round on 2019-11

📋 Description

• At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. • Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare. • We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer. • Our flexible work benefit - Scribd Flex - enables employees, in partnership with their manager, to choose the daily work-style that best suits their individual needs. • As an organization, we prioritize collaboration and intentional in-person moments to build culture and connection. • For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location. • The ML Data Engineering team is at the heart of metadata extraction and enrichment for all of our brands, managing and processing hundreds of millions of documents, billions of images, and serving millions of users. • We operate at an unparalleled scale, handling diverse datasets, including UGC documents, ebooks, audiobooks, and more. • Our goal is to build robust systems that drive content discovery, trust, and structured metadata across our platforms. • We are seeking a Software Engineer II with a strong background in data engineering, software development, and scalable systems. • As part of the ML Data Engineering team, you will work on designing, building, and optimizing systems that extract, enrich, and process metadata at scale. • You’ll collaborate closely with machine learning teams, product managers, and other engineers to ensure the smooth integration and processing of vast amounts of structured metadata. • Our team uses various technologies: Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, Sagemaker, Cloudwatch, Datadog) and Terraform.

🎯 Requirements

• 3+ years of experience as a professional software engineer. • Proficient in one or more programming languages, such as Python, Ruby, Scala, or similar. • Hands-on experience with data processing frameworks like Apache Spark, Databricks, or similar tools for large-scale data processing. • Experience working with systems at scale. • Experience working with a public cloud provider (AWS, Azure, or Google Cloud). • Hands-on experience with building, deploying, and optimizing solutions using ECS, EKS or AWS Lambdas. • Proven ability to test and optimize systems for performance and scalability. • Bachelor’s in CS or equivalent professional experience. • Bonus points if you have experience working with Machine Learning systems.

🏖️ Benefits

• Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees • 12 weeks paid parental leave • Short-term/long-term disability plans • 401k/RSP matching • Tuition Reimbursement • Learning & Development programs • Quarterly stipend for Wellness, Connectivity & Comfort • Mental Health support & resources • Free subscription to Scribd + gift memberships for friends & family • Referral Bonuses • Book Benefit • Sabbaticals • Company wide events • Team engagement budgets • Vacation & Personal Days • Paid Holidays (+ winter break) • Flexible Sick Time • Volunteer Day • Company-wide Diversity, Equity, & Inclusion programs

Apply Now

January 14

Join BioRender as a Data Engineer; enhance scientific communication via data-driven visuals and solutions.

January 6

Develop low-latency data products to enhance customer experiences at Netflix. Work with data scientists and engineers yielding critical data insights.

January 6

Join Netflix to enhance the device data model, improving data quality and supporting analytics. Engineer scalable data pipelines for structured and unstructured data.

Discover 100,000+ Remote Jobs!

Join now to unlock all jobs

Discover hidden jobs

We scan the internet everyday and find jobs not posted on LinkedIn or other job boards.

Head start against the competition

We find jobs within 24 hours of being posted, so you can apply before everyone else.

Be the first to know

Daily emails with new job openings straight to your inbox.

Choose your membership

Loved by 10,000+ remote workers
🎉$6 / week

Cancel anytime

MOST POPULAR
🥳$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
🥰$54 / year
$216
Save 75% vs monthly

Cancel anytime

Wall of Love

Frequently asked questions

We use powerful scraping tech to scan the internet for thousands of remote jobs daily. It operates 24/7 and costs us to operate, so we charge for access to keep the site running.

Of course! You can cancel your subscription at any time with no hidden fees or penalties. Once canceled, you’ll still have access until the end of your current billing period.

Other job boards only have jobs from companies that pay to post. This means that you miss out on jobs from companies that don't want to pay. On the other hand, Remote Rocketship scrapes the internet for jobs and doesn't accept payments from companies. This means we have thousands more jobs!

New jobs are constantly being posted. We check each company website every day to ensure we have the most up-to-date job listings.

Yes! We’re always looking to expand our listings and appreciate any suggestions from our community. Just send an email to Lior@remoterocketship.com. I read every request.

Remote Rocketship is a solo project by me, Lior Neu-ner. I built this website for my wife when she was looking for a job! She was having a hard time finding remote jobs, so I decided to build her a tool that would search the internet for her.

Why I created Remote Rocketship

Choose your membership

Loved by 10,000+ remote workers
🎉$6 / week

Cancel anytime

MOST POPULAR
🥳$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
🥰$54 / year
$216
Save 75% vs monthly

Cancel anytime

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com