Site Reliability Engineer - Rust - Core Backend - Remote @ Kraken Digital Asset Exchange - Princeton, NJ

Job Overview

4 months ago

Site Reliability Engineer - Rust - Core Backend - Remote

Kraken Digital Asset Exchange - Princeton, NJ

About Kraken

Our mission is to accelerate the adoption of cryptocurrency so that you and the rest of the world can achieve financial freedom and inclusion. In our first decade, Kraken has risen to become one of the largest, most successful and respected crypto exchanges on the planet.

We are changing the way the world thinks about finance and our range of successful products are playing a critical role in the mainstream adoption of crypto assets. We continue to trail-blaze into new territory with the introduction of Kraken Bank, providing a more seamless integration between crypto and the traditional financial system. This makes us the first crypto company (ever) to be awarded a U.S. state banking charter.

Our diverse group of 2,000+ Krakenites are distributed all over the world as part of our 'remote first' culture, united by a shared passion for delighting customers, upholding crypto values and achieving our meaningful mission. We attract people who push themselves to improve, are radically transparent and think differently in order to unlock their potential.

Crypto is a rapidly evolving industry and we’re just getting started. We’re growing fast and you're invited to join the revolution!

About the Role

This is a fully remote role, we will consider applicants based in North America, South America and EMEA

Our Engineering team is having a blast while delivering the most sophisticated crypto-trading platform out there. Help us continue to define and lead the industry.

As part of Kraken's Core Backend team, you will work within a world-class team of engineers building Kraken's infrastructure using Rust. As a Site Reliability Engineer, you will be keeping one of the fastest growing companies in the world up and available in a 24/7 environment. You will bring your own technical expertise to monitor and support staging and production environments, build tooling, CI/CD pipelines, deployment specs and generally automate internal processes to empower developers and improve team efficiency.

Responsibilities

  • Monitor and support Staging and Production environments
  • Improve Developer Tooling, help with building Docker images, manage our Continuous Integration (CI) pipelines for automating quality testing, track key metrics, and generate reports
  • Collaborate with Dev, QA, and Product teams, jump in to support and improve development and release cycle
  • Automate syncing between services to improve the backend developer and QA workflows
  • Develop tools and bots to improve and automate internal processes
  • Support a fully distributed team operating across numerous timezones

Requirements

  • 5+ years in a DevOps role (Devops, SRE, etc)
  • 1-3+ years experience with a programming language (Rust and/or Golang)
  • Proficient in Git source version-control
  • Thorough knowledge of Docker and extensive experience with Kubernetes, experience with Terraform and Helm Charts
  • Passion for improving process and products
  • Experience configuring Continuous Integration (CI)
  • Ability to thrive while working independently and remotely in a team-based environment
  • Self-starter, ability to context-switch between various projects, codebases and concepts
  • Ability to independently debug problems involving the network and operating system
  • Well-versed in scripting languages, building and administration of Linux
  • Interest in security and a thoughtful and thorough consideration of the security implications of development decisions

Nice to haves

  • Passion for open-source and contributing back to the community
  • Experience with Cloud infrastructure
  • Experience benchmarking applications and identifying bottlenecks
  • Experience with Slack, Jira, Google, and/or Gitlab APIs
  • Experience with monitoring / alerting (primarily with Prometheus / Grafana) and knowledge of best practices in the area
  • Experience with distributed systems and technologies (gRPC, Kafka, NoSQL, SQL, Redis, ...)

Location Tagging: #US #EU #LI-LM1

We’re powered by people from around the world with their own unique and diverse experiences. We value all Krakenites and their talents, contributions, and perspectives, regardless of their background.

As an equal opportunity employer we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.

Job Type: Full-time

Work Location: One location

Similar Jobs

Site Reliability Engineer

JPMorgan Chase Bank, N.A.

Wilmington, DE

Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes.

Site Reliability Engineer

Infinity Consulting Solutions, Inc.

Philadelphia, PA

In this role, you will advance monitoring, reporting and alerting capabilities, enhance existing systems so that they repair themselves, automate tasks as well…

Senior Site Reliability Engineer

Angi

New York, NY

Use service level information to determine reliability on our Telemetry Platform. Experience identifying changes that improve processes from a reliability and…

Site Reliability Engineer

Piper Companies

Philadelphia, PA

Keywords: automation, systems engineering, cloud engineering, sre, site reliability, public cloud, hybrid cloud, azure, aws, devops, ansible chef, puppet,…

Lead Site Reliability Engineer, CloudOps

NBCUniversal

Englewood Cliffs, NJ

This is a critical role in NBC’s Ad Sales Custom Development organization. Build Automation into all aspects of the DevSecOps process using tools like GitLabs,…

Lead Site Reliability Engineer

Hudson's Bay

New York, NY

Design and develop scenarios for new site functionalities and perform Load Testing for high volume Ecommerce websites of HBC using tools such as Gatling,…

Site Reliability Engineer ( SRE ) - Remote

Webstaurant Store, Inc.

Lititz, PA

Likewise, systems engineers that have a desire to improve infrastructure and to reduce repetitive tasks also make a good fit. We use Ansible and Terraform.

DevOps Developer

Comcast

Philadelphia, PA

Working with fellow DevOps Engineers to build and maintain our production tools to ensure ongoing reliability while improving development team efficiency.

Site Reliability Engineer

Adobe

New York, NY

You will create automated processes that will streamline team workflows, and cultivate relationships with peers and leadership across the Creative Cloud teams.

Senior Site Reliability Engineer

Celonis SE

New York, NY

A track record of proactive monitoring and automation as the base for reliability. Site Reliability and Platform Engineering.

Senior Linux Site Reliability Engineer: Pacemaker - Remote

SAP

Newtown Square, PA

You will be identifying and resolving architectural and design issues in existing Pacemaker setup, developing automation to ensure stability and reliability of…

Senior Site Reliability Engineer

Pearson

Trenton, NJ

This role requires a generalist who can contribute with needs in development, system operations, infrastructure as code, automation, observability, security…

Site Reliability Engineer

JPMorgan Chase Bank, N.A.

Jersey City, NJ

Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes.

Senior Site Reliability Engineer

Pearson

Dover, DE

This role requires a generalist who can contribute with needs in development, system operations, infrastructure as code, automation, observability, security…

Site Reliability Engineer

Macquarie Group Limited

Philadelphia, PA

Knowledge of DevOps and Site Reliability Engineering principles. Development knowledge using Java or Python,. Ability to create SQL statements, Windows and Unix…

Sr. Site Reliability Engineer (Hybrid- Flex Options)

Broadridge

Edgewood, NY

Implements additional operational improvements for automation, monitoring and incident management to increase the reliability of Broadridge services.

REMOTE Sr. Site Reliability Engineer (SRE), Product Operations

Pluto TV

New York, NY

The Engineer in this role will lead the charge during critical incidents, enabling the Production Operations group to increase reliability across various tools…

Engineer, Devops and Systems Engineering

Comcast

Philadelphia, PA

Works with engineering project management and lead engineer to deliver applications that meet or exceed product requirements, project schedules and reliability.

Senior Site Reliability Engineer

Wonder

New York, NY

The platform engineering team manages a fast-paced and constantly growing environment that seeks to implement cutting-edge processes, tools and frameworks to…

Site Reliability Engineer, Security

Contegix

Philadelphia, PA

There are no management or leadership requirements within this role. You will be responsible for the technical design, planning, implementation, performance…

Professional Site Reliability Engineer- (Remote)

Broadridge

Newark, NJ

This is a remote role where you will work off-site. Implements additional operational improvements for automation, monitoring, and incident management to…

Professional Site Reliability Engineer- (Remote)

Broadridge

Edgewood, NY

This is a remote role where you will work off-site. Implements additional operational improvements for automation, monitoring, and incident management to…

Site Reliability Engineer

1-800-Flowers

Jericho, NY

Experience with GCP is a huge plus. This role requires staying calm under pressure, and a willingness to work extra hours especially during holiday periods or…

Senior DevOps Engineer, Wirecutter

The New York Times

New York, NY

Administer our AWS infrastructure to maximize reliability, performance, and security. Qualified candidates are cross-disciplined, eagerly take initiative, think…