Senior Site Reliability Engineer - Public Cloud @ JPMorgan Chase Bank, N.A. - Jersey City, NJ

Job Overview

8 days ago

Senior Site Reliability Engineer - Public Cloud

JPMorgan Chase Bank, N.A. - Jersey City, NJ

PUBLIC CLOUD ENABLEMENT - PLATFORM SRE

Organization Description

Our CCB-IPM (Infrastructure Production Management) division is fueled by innovators like you who are driven to create technology solutions that make us work more efficiently and help our businesses grow and maintain the stability and reliability of our applications and platforms - the Public Cloud Enablement Team is one that defines and implements the standards and best practices that Product teams should follow ensuring adequate controls when migrating to public cloud while maintaining parity across the board and gaining speed, efficiency and quality.

Job Description

As an experienced Public Cloud Platform SRE professional, you will be an integral part of the Public Cloud Enablement Team and you'll be making decisions on Public Cloud strategy, on-boarding and migration that impact our customers, clients, and businesses around the globe. Your expertise in public cloud migrations of complex systems, anticipating problems, and finding ways to mitigate risk, will be key in leading numerous public cloud initiatives. Some of the key pillars you would be driving hands-on are platform modernization, Technology life cycle management, Security, Performance Testing, Resiliency and Automation across Public Cloud Platforms including AWS, GCP, Azure. In addition, you will:
  • Collaborate with product and engineering teams to deliver robust cloud-based solutions that drive enhanced customer experiences
  • own end-to-end platform issues & help provide solutions to platform build and performance issues on the AWS Cloud & ensure the deliverables are applications bug free.
  • strategize and guide various product teams on the standards and best practices related to the Public Cloud On-boarding process and help them migrate to public cloud while meeting all regulatory/compliance requirements.
  • Develop, enhance, and maintain established standards and best practices, including the Public Cloud PTx process.

And while you'll be part of a tight-knit team that shares your passion for modern technology, you'll also gain access to the best minds in the business-both as part of the JPMorgan Chase & Co. global technology community, and through our partnerships with some of the most important technology firms in the world.

Role/Responsibilities
  • Help shape and deliver on a strategy to build broad use of Amazon's utility computing web services (e.g., AWS EC2,
  • AWS S3, AWS RDS, AWS CloudFront, AWS EFS, CloudWatch)
  • Design resilient, secure, and high performing platforms in Public Cloud using JPMC best practices
  • Improve reliability, quality, and time-to-market of our suite of software solutions moving to public cloud
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for the public cloud platform
  • Debug and optimize systems and automate routine tasks.
  • Collaborate with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.
  • Drive work streams to ensure Applications meet strict non-functional requirements for Public Cloud On-boarding
  • Drive Cost management through the effective design and optimization of public cloud platforms and technologies.
  • Organize and run game days, resiliency tests and chaos engineering exercises.
  • Utilize programming languages like Java, Python, SQL, Node, Go, and Scala, Open Source RDBMS and NoSQL databases, Container Orchestration services including Docker and Kubernetes, and a variety of AWS tools and services
  • Monitor metrics and program health, anticipate and clear blockers, manage escalations
  • Roll your sleeves up in deep problem solving
  • Lead and mentor other technical resources on the team

Required Qualifications
  • Advanced understanding of business technology drivers and their impact on architecture design, performance and monitoring, best practices
  • 10-12 years experience across the SDLC process - Design and/or Development and/or support
  • 2-4 years experience designing and building web environments on AWS, which includes working with services like EC2, ELB, RDS, and S3
  • Experience building and maintaining cloud-native applications
  • Experience using DevOps tools in a cloud environment, such as Ansible, Artifactory, Docker, GitHub, Jenkins, Kubernetes, Maven, and Sonar Qube
  • Experience using monitoring solutions like CloudWatch, Prometheus, Datadog
  • Experience of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform
  • Experience with one or more public cloud platforms like AWS, GCP, Azure
  • Experience with one or more automation tools like Terraform, Puppet, Ansible
  • Provide technical direction and leadership to internal teams and clients.
  • Experience with high volume, mission critical applications and their interdependencies with other applications and databases
  • Ability to leverage Splunk and Dynatrace to identify and troubleshoot issues
  • Experience of Agile delivery and tools including Kanban framework
  • Working knowledge of DevOps Tool chains and CICD
  • Experience with high volume, mission critical applications, and building upon messaging and or event-driven architectures.
  • Experience of container platforms such as Docker and Kubernetes.
  • Command over architecture, design, and business processes
  • Keen understanding of financial and budget management, control and optimization of Public Cloud expenses
  • Expertise in working in in large, collaborative teams to achieve organizational goals
  • Passionate about building an innovative culture
  • Experience with production support of highly available applications
  • Experience with system performance monitoring and operational capacity management
  • Ability to articulate to more experienced management a technical strategy in clear, concise, understandable terms
  • Strong communication and collaboration skills

Preferred Qualifications
  • Bachelor's degree in computer science or other technical, scientific discipline
  • Experience with distributed storage technologies like NFS, HDFS, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes)
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • SRE mindset Culture/Approaches: To run better production systems by creating engineering solutions to operational problems.
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • Ansible and other dev ops tools is added advantage.

JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as any mental health or physical disability needs.

The health and safety of our colleagues, candidates, clients and communities has been a top priority in light of the COVID-19 pandemic. JPMorgan Chase was awarded the "WELL Health-Safety Rating" for all of our 6,200 locations globally based on our operational policies, maintenance protocols, stakeholder engagement and emergency plans to address a post-COVID-19 environment.

As a part of our commitment to health and safety, we have implemented various COVID-related health and safety requirements for our workforce. Employees are expected to follow the Firm's current COVID-19 or other infectious disease health and safety requirements, including local requirements. Requirements include sharing information including your vaccine card in the firm's vaccine record tool, and may include mask wearing. Requirements may change in the future with the evolving public health landscape. JPMorgan Chase will consider accommodation requests as required by applicable law.

Equal Opportunity Employer/Disability/Veterans

Similar Jobs

DevOps Engineer

Comcast

West Chester, PA

Focused on improving and enhancing reliability and sustainability of production applications and systems. This person should have strong experience working in a…

DevOps Engineering Lead

New York Life Insurance Co

Lebanon, NJ

The DevOps Engineering Lead will be responsible for the DevOps transformation strategy execution, will bridge the gap between development, testing, change…

Site Reliability Engineering Manager

Wells Fargo

New York, NY

The team will drive technology transformation and adoption of SRE aligned enterprise capabilities and products, launch new tooling enablement, automate away…

IKP Site Reliability Engineer

HSBC

Jersey City, NJ

Balance feature development speed and reliability with well-defined service level objectives. Improve reliability, quality, and time to upgrade cluster and…

Software Dev Eng II - Ads, DSP Site Reliability Engineering

Amazon.com Services LLC

New York, NY

1+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and current systems.

Site Reliability Engineer (Observability and Monitoring)

Underdog Fantasy

Brooklyn, NY

Own UD's production environments hosted in GKE and Anthos and develop processes to maintain uptime requirements. 16 weeks of fully paid parental leave.

Devops/Cloud Engineer

Qcom

Wayne, NJ

Recommend, develop and implement system enhancements that will improve the performance and reliability of the system including installing, upgrading/patching,…

DevOps Engineer

1010data

New York, NY

We are seeking a seasoned Senior Devops Engineer with deep Linux and Kubernetes experience to work with a team of talented engineers and developers to build and…

DevOps Engineer

Children's Hospital of Philadelphia

Philadelphia, PA

This position will work approximately 80% remote, 20% on site in our Philadelphia offices. Ensure service reliability and service availability to ensure…

Site Reliability/DevOps Engineer - Opportunity for Working Remotely New York, NY

VMware

New York, NY

You will be responsible for improving the reliability and resiliency of microservices by enforcing DevOps/SRE best practices across engineering org.

Site Reliability/DevOps Engineer - Opportunity for Working Remotely Newark, NJ

VMware

Newark, NJ

You will be responsible for improving the reliability and resiliency of microservices by enforcing DevOps/SRE best practices across engineering org.

Site Reliability Engineering Manager, Trello (Storage Layer)

Atlassian

New York, NY

You’re familiar with system design, site reliability engineering and databases. Assuming you have eligible working rights and a sufficient time zone overlap…

Site Reliability Engineer

Jotform

Manhattan, NY

This is a full-time, fully remote opportunity in the Pacific time zone, though an exception can be made for a great fit located elsewhere in the U.S. who is…

Site Reliability Engineer / SRE : 10+ years exp needed

PC Services inc

New York, NY

Design, implement and monitor the Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for the services you are supporting.

Site Reliability/DevOps Engineer - Opportunity for Working Remotely Bridgeport, CT

VMware

Bridgeport, CT

You will be responsible for improving the reliability and resiliency of microservices by enforcing DevOps/SRE best practices across engineering org.

Infrastructure Site Reliability Engineer

Schrödinger

New York, NY

This position presents the unique opportunity to support researchers and developers who are continually breaching the boundaries of what's possible in drug and…

Site Reliability Engineer

infoObject

Philadelphia, PA

Interview*: 2 rounds of interviews: 1st round (30min MS Video Teams Interview), 2nd Interview: 1 hour w/ 3 Engineers on the team. 5-6 years of experience.

Site Reliability/DevOps Engineer - Opportunity for Working Remotely Philadelphia, PA

VMware

Philadelphia, PA

You will be responsible for improving the reliability and resiliency of microservices by enforcing DevOps/SRE best practices across engineering org.

Senior DevOps Engineer, VP - hybrid

MUFG

Jersey City, NJ

Experience implementing enterprise systems with security best practices and site reliability engineering principles. Bring code assets under version control.

Site Reliability Engineer, Americas

Canonical - Jobs

New York, NY

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. A wide range of engineering disciplines and career…

Site Reliability Engineer, Americas

Canonical - Jobs

Philadelphia, PA

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. A wide range of engineering disciplines and career…

Site Reliability Engineer

JPMorgan Chase Bank, N.A.

Jersey City, NJ

Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes.

Site Reliability Engineer

Comcast

Philadelphia, PA

Seek out potential threats to security and reliability, advocate solutions, and assist teams to aim to successful resolution.

Site Reliability Engineer - Private Cloud

JPMorgan Chase Bank, N.A.

Jersey City, NJ

§ Apply standards of cloud compliance to application design to achieve reliability. § Experience in site reliability engineering in one of the following…