Site Reliability Engineer

January 25

Apply Now
Logo of Aurora Labs

Aurora Labs

Aurora Labs is a development company that specializes in creating blockchain technologies. It is the developer behind Aurora, an Ethereum Virtual Machine (EVM) blockchain built on the NEAR Protocol, and also offers Aurora Cloud, a suite of products aimed at helping Web2 companies integrate Web3 technologies. Aurora Labs focuses on creating an ecosystem that supports blockchain growth and adoption, and is rapidly expanding its team to include various engineering and marketing roles.

blockchain • EVM • NEAR Protocol • Ethereum • Layer2

51 - 200 employees

🌐 Web 3

Description

• Aurora is a network of Virtual Chains that combines NEAR’s scalability with powerful infrastructure for the easy deployment of preconfigured blockchains. • We invite you to be a part of our team of smart, professional, result-oriented and fun individuals. • Join us to help ensure that our background processes run smoothly while we are striving to become the best in the industry. • Our infrastructure team is responsible for building and supporting critical systems required for running and accessing NEAR and Aurora networks. • Load balancing, caching, queueing, transaction simulation and block production is processed by the services written and maintained by the infrastructure team. • These services operate at large scale and process terabytes of data. • The platform is based on open-source software, such as Kubernetes, NATS, Jetstream, Blockscout, Grafana, Postgres and Near-core, alongside a few internally developed services. • This role is split between two responsibilities: site reliability (80%) and software engineering (20%). • Reliability Engineering includes ensuring high availability and failure tolerance of our infrastructure, automating configuration and maintenance of software components, • Design and implementation of cloud-agnostic solutions without exclusively relying on specific cloud vendors. • Incident management, monitoring, distributed tracing and recovery automation. • Software Engineering projects include sidecars that implement infrastructure cloud-agnostic abstractions for developers and CLI tools for pubsub and streaming infrastructure operations. • You are a reliability engineer with experience of creating and maintaining backend systems. • You are familiar with the entire Linux stack and can easily find a bottleneck in a distributed system. • You have developed CLI tools and backend services before and are comfortable applying your software development skills to automate your daily operations or to create a microservice on the request path of the end user.

Requirements

• Strong emphasis on SRE as an engineering subject area, with proficiency in Golang. • Successful track-record and proven experience as a backend internet services software developer. • Knowledge of SDLC, including continuous integration and testing methodologies. • Understanding of base internet infrastructure services including DNS, HTTP, server virtualization, server monitoring in critical, large scale distributed systems. • Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements. • Excellent verbal and written communication skills in English. • Experience with development within Kubernetes ecosystem, including operator framework, controllers and CRDs. • Experience with streaming and pubsub systems such as NATS, Apache Kafka, Apache Pulsar. • Hardware bootstrap and associated security. • Structured or unstructured storage and caching. • Automating operations processes via services and tools. • Configuration management and fleet orchestration via Puppet, Chef, Ansible, or others. • Cloud Services (AWS S3/EC2/CloudFront or equivalent).

Apply Now

Discover 90,000+ Remote Jobs!

Join now to unlock all job opportunities.

Find your dream remote job

Discover hidden jobs

We scan the internet everyday and find jobs not posted on LinkedIn or other job boards.

Head start against the competition

We find jobs within 24 hours of being posted, so you can apply before everyone else.

Be the first to know

Daily emails with new job openings straight to your inbox.

Choose your membership

Cancel anytime

Loved by 10,000+ remote workers

Wall of Love

Frequently asked questions

We use powerful scraping tech to scan the internet for thousands of remote jobs daily. It operates 24/7 and costs us to operate, so we charge for access to keep the site running.

Of course! You can cancel your subscription at any time with no hidden fees or penalties. Once canceled, you’ll still have access until the end of your current billing period.

Other job boards only have jobs from companies pay to post. This means that you miss out on jobs from companies that don't want to pay. On the other hand, Remote Rocketship scrapes the internets for jobs and doesn't accept payments from companies. This means we have thousands of more jobs!

New jobs are constantly being posted. We check each company website every day to ensure we have the most up-to-date job listings.

Yes! We’re always looking to expand our listings and appreciate any suggestions from our community. Just send an email to Lior@remoterocketship.com. I read every request.

Remote Rocketship is a solo project by me, Lior Neu-ner. I built this website for my wife when she was looking for a job! She was having a hard time finding remote jobs, so I decided to build her a tool that would search the internet for her.

Why I created Remote Rocketship

Choose your membership

Cancel anytime

Loved by 10,000+ remote workers
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com