Data Research Engineer - Data Extraction

March 25

Apply Now
Logo of Forbes

Forbes

Forbes is a prominent global media company, known for its brand of content covering a wide range of topics such as business, investing, technology, entrepreneurship, leadership, and lifestyle. It features news, analysis, and information about issues of concern to industry leaders and affluent individuals. Forbes is widely recognized for its authoritative rankings of businesses and industry leaders, such as the Forbes Global 2000, and is a trusted source for in-depth analysis of current events and trends in the financial and corporate worlds. Through its digital and print platforms, Forbes engages its audience with high-quality journalism and thought leadership content.

Business • Finance • Investing • Technology • Politics

201 - 500 employees

Founded 1917

📱 Media

💸 Finance

👥 B2C

💰 $200M Corporate Round on 2022-02

📋 Description

• Develop methods and processes for data quality assurance (QA) to ensure accuracy, completeness, and integrity. • Define and implement data validation rules and automated data quality checks. • Perform data profiling and analysis to identify anomalies, outliers, and inconsistencies. • Assist in acquiring and integrating data from various sources, including web crawling and API integration. • Develop and maintain scripts in Python for data extraction, transformation, and loading (ETL) processes. • Stay updated with emerging technologies and industry trends. • Explore third-party technologies as alternatives to legacy approaches for efficient data pipelines. • Contribute to cross-functional teams in understanding data requirements. • Assume accountability for achieving development milestones. • Prioritize tasks to ensure timely delivery, in a fast-paced environment with rapidly changing priorities. • Collaborate with and assist fellow members of the Data Research Engineering Team as required. • Leverage online resources effectively like StackOverflow, ChatGPT, Bard, etc., while considering their capabilities and limitations.

🎯 Requirements

• Bachelor's degree in Computer Science, Data Science, or a related field. • Strong proficiency in Python programming for data extraction, transformation, and loading. • Proficiency in SQL and data querying is a plus. • Knowledge of Python modules such as Pandas, SQLAlchemy, gspread, PyDrive, BeautifulSoup and Selenium, sklearn, Plotly. • Knowledge of web crawling techniques and API integration. • Knowledge of data quality assurance methodologies and techniques. • Experience in AI/ML engineering and data extraction. • Experience with LLMs, NLP frameworks (spaCy, NLTK, Hugging Face, etc.) • Strong understanding of machine learning frameworks (TensorFlow, PyTorch). • Experience with RESTful API design and integration. • Design and build AI models using LLMs. • Integrate LLM solutions with existing systems via APIs. • Collaborate with the team to implement and optimize AI solutions. • Monitor and improve model performance and accuracy. • Familiarity with HTML, CSS, JavaScript. • Familiarity with Agile development methodologies is a plus. • Strong problem-solving and analytical skills with attention to detail. • Creative and critical thinking. • Ability to work collaboratively in a team environment. • Good and effective communication skills. • Experience with version control systems, such as Git, for collaborative development. • Ability to thrive in a fast-paced environment with rapidly changing priorities. • Comfortable with autonomy and ability to work independently.

🏖️ Benefits

• Day off on the 3rd Friday of every month (one long weekend each month) • Monthly Wellness Reimbursement Program to promote health well-being • Monthly Office Commutation Reimbursement Program • Paid paternity and maternity leaves

Apply Now

Discover 100,000+ Remote Jobs!

Join now to unlock all jobs

Discover hidden jobs

We scan the internet everyday and find jobs not posted on LinkedIn or other job boards.

Head start against the competition

We find jobs as soon as they're posted, so you can apply before everyone else.

Be the first to know

Daily emails with new job openings straight to your inbox.

Choose your membership

Loved by 10,000+ remote workers
🎉$6 / week

Cancel anytime

MOST POPULAR
🥳$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
🥰$54 / year
$216
Save 75% vs monthly

Cancel anytime

Wall of Love

Frequently asked questions

We use powerful scraping tech to scan the internet for thousands of remote jobs daily. It operates 24/7 and costs us to operate, so we charge for access to keep the site running.

Of course! You can cancel your subscription at any time with no hidden fees or penalties. Once canceled, you’ll still have access until the end of your current billing period.

Other job boards only have jobs from companies that pay to post. This means that you miss out on jobs from companies that don't want to pay. On the other hand, Remote Rocketship scrapes the internet for jobs and doesn't accept payments from companies. This means we have thousands more jobs!

New jobs are constantly being posted. We check each company website every day to ensure we have the most up-to-date job listings.

Yes! We’re always looking to expand our listings and appreciate any suggestions from our community. Just send an email to Lior@remoterocketship.com. I read every request.

Remote Rocketship is a solo project by me, Lior Neu-ner. I built this website for my wife when she was looking for a job! She was having a hard time finding remote jobs, so I decided to build her a tool that would search the internet for her.

Why I created Remote Rocketship

Choose your membership

Loved by 10,000+ remote workers
🎉$6 / week

Cancel anytime

MOST POPULAR
🥳$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
🥰$54 / year
$216
Save 75% vs monthly

Cancel anytime

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com