Senior Site Reliability Engineer Job at Glocomms, San Jose, CA

Y3FpaXFQWCtWRlE5VDJPUStCc3pwaURSRmc9PQ==
  • Glocomms
  • San Jose, CA

Job Description

Site Reliability Engineer

At the intersection of machine learning and large-scale infrastructure, the SRE team for our Applied Machine Learning group is redefining how intelligent systems operate at global scale. We blend the principles of software engineering with systems reliability to keep our AI and recommendation systems resilient, high-performing, and ever-evolving.

As a Site Reliability Engineer on this team, you'll be hands-on with some of the most advanced AI technologies, helping architect, maintain, and scale machine learning platforms that serve millions-if not billions-of users. You'll also play a critical role in optimizing system performance, making hardware and capacity recommendations, and automating everything possible.

What You'll Do:

  • Ensure our ML systems run smoothly, efficiently, and reliably-no matter how complex or large they get.

  • Dive deep into the guts of distributed systems to identify and resolve bottlenecks before they become outages.

  • Contribute to and lead the automation of infrastructure, pipelines, and operational routines.

  • Collaborate with engineering and hardware teams on capacity planning, architecture choices, and performance tuning.

What You Bring:

  • Deep knowledge of distributed systems and the experience to troubleshoot them with precision.

  • A Bachelor's or Master's in Computer Science or a closely related field focused on software development or systems engineering.

  • Solid programming chops in at least one of the following: Python, C/C++, or Go.

  • Strong foundation in algorithms, data structures, and computer science fundamentals.

Preferred Extras:

  • Experience designing and operating high-scale, high-availability systems.

  • Passion for writing clean, optimized code and automating away manual tasks.

  • Prior SRE experience in large distributed production environments.

Job Tags

Similar Jobs

PRIDE Health

Travel Oncology Pharmacist - $2,999 per week Job at PRIDE Health

 ...PRIDE Health is seeking a travel Pharmacist for a travel job in Pittsfield, Massachusetts. Job Description & Requirements ~ Specialty: Pharmacist ~ Discipline: Allied Health Professional ~ Start Date: 08/18/2025~ Duration: 13 weeks ~40 hours per week... 

Envoy Air Inc.

Part-Time Ramp and Customer Service Agent Job at Envoy Air Inc.

Overview: Come and work for Envoy Air, an American Airlines Group Company, at Fort Smith Regional Airport and watch your career take off! You will join a stable, FUN, secure, and fast-growing team committed to providing outstanding customer service. We are hiring... 

Yexgo

Entry Level Data Entry Clerk (100% Remote) Job at Yexgo

 ...Job Description We are seeking a detail-oriented and organized individual to join our team as an Entry Level Data Entry Clerk. As a Data Entry Clerk, you will be responsible for inputting and maintaining accurate data into our systems. This is a part-time remote... 

Cogent Infotech

Java Developer - Recent Graduates - Entry Level Positions Job at Cogent Infotech

Are you ready to change your career trajectory to Tech? This is an opportunity to interview for and join an Entry-Level Full Stack Java Development Team, completely beginner friendly ! Cogent Infotech is on the lookout for dynamic individuals with an interest in... 

Rodale Institute

Research Farm Manager Job at Rodale Institute

Rodale Institute is hiring a Research Farm Manager to oversee farm operations at our regional resource center in Rockport, WA. Rodale...  ...directly to the Research Director. Applicants should be driven, hard-working, and committed to regenerative organic agricultural research....