July 10
Ansible
Apache
AWS
Chef
Cloud
Distributed Systems
DNS
EC2
Grafana
Kafka
Kubernetes
Postgres
Pulsar
Puppet
SDLC
Terraform
Web3
Go
• Aurora Labs is the development company behind Aurora—the EVM blockchain that runs on the NEAR Protocol. • We are also the developers of, and integration partner behind, Aurora Cloud—a suite of products that allow Web2 companies to capture the value of Web3. • This role is split between two responsibilities: software engineering (80%) and site reliability (20%). • Software Engineering projects include: • Shield - a security service to protect users from making errors or executing malicious transactions. • Mempool - a system to store/reorder transactions before they can hit the blockchain. • Relayer - translates RPC calls on the read and write path from the end user. • Explorer - Blockscout-based system that provides a user interface. • Aurora Cloud - a system to automatically provision multiple infrastructure stacks for Aurora Engine. • CLI tools for pubsub and streaming infrastructure operations. • Indexers and blockchain event aggregation pipelines for monitoring purposes. • Reliability Engineering includes: • Automating configuration and maintenance of software components such as K8s, NATS, Influxdb, Postgres, Cloudflare using e.g. Ansible, Terraform, Helm and kubernetes operators. • Design and implementation of cloud-agnostic solutions without exclusively relying on specific cloud vendors. • Optimizing the latency and throughput of the pub-sub infrastructure. • Incident management, troubleshooting, monitoring, distributed tracing and recovery automation. • You are a software engineer with experience of creating and maintaining backend systems.
• Experience with Devops or SRE as an engineering subject area, with proficiency in Golang. • Successful track-record and proven experience as a backend internet services software developer. • Knowledge of SDLC, including continuous integration and testing methodologies. • Understanding of base internet infrastructure services including DNS, HTTP, server virtualization, server monitoring in critical, large scale distributed systems. • Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements. • Excellent verbal and written communication skills in English • Deep familiarity with Go or other system-oriented programming languages. • Experience with development within Kubernetes ecosystem, including operator framework, controllers and CRDs. • Experience with streaming and pubsub systems such as NATS, Apache Kafka, Apache Pulsar. • Automating operations processes via services and tools. • Configuration management and fleet orchestration via Puppet, Chef, Ansible, or others. • Cloud Services (AWS S3/EC2/CloudFront or equivalent).
Apply Now