Senior Site Reliability Engineer Job at Glocomms, San Jose, CA

Y3FpaXFQWCtWRlE5VDJPUStCc3pwaURSRmc9PQ==
  • Glocomms
  • San Jose, CA

Job Description

Site Reliability Engineer

At the intersection of machine learning and large-scale infrastructure, the SRE team for our Applied Machine Learning group is redefining how intelligent systems operate at global scale. We blend the principles of software engineering with systems reliability to keep our AI and recommendation systems resilient, high-performing, and ever-evolving.

As a Site Reliability Engineer on this team, you'll be hands-on with some of the most advanced AI technologies, helping architect, maintain, and scale machine learning platforms that serve millions-if not billions-of users. You'll also play a critical role in optimizing system performance, making hardware and capacity recommendations, and automating everything possible.

What You'll Do:

  • Ensure our ML systems run smoothly, efficiently, and reliably-no matter how complex or large they get.

  • Dive deep into the guts of distributed systems to identify and resolve bottlenecks before they become outages.

  • Contribute to and lead the automation of infrastructure, pipelines, and operational routines.

  • Collaborate with engineering and hardware teams on capacity planning, architecture choices, and performance tuning.

What You Bring:

  • Deep knowledge of distributed systems and the experience to troubleshoot them with precision.

  • A Bachelor's or Master's in Computer Science or a closely related field focused on software development or systems engineering.

  • Solid programming chops in at least one of the following: Python, C/C++, or Go.

  • Strong foundation in algorithms, data structures, and computer science fundamentals.

Preferred Extras:

  • Experience designing and operating high-scale, high-availability systems.

  • Passion for writing clean, optimized code and automating away manual tasks.

  • Prior SRE experience in large distributed production environments.

Job Tags

Similar Jobs

Civic Tax Relief

Enrolled Agent Job at Civic Tax Relief

 ...so that they get the fresh start they deserve. We pride ourselves on culture, environment, ethics and values. This full-time Enrolled Agent career opportunity has a Monday - Friday schedule Responsibilities: Calling and dealing with the IRS or State for U.S tax... 

Farm Job Search

Dairy Farm Manager Job at Farm Job Search

 ...Dairy Farm Manager (6207) Location: Iowa JobNumber: 6207 Dairy Farm Manager position immediately available on a 500 plus cow dairy in Northeastern Iowa. Owner travels extensively and requires a farm manager to oversee the operation in his absence. Must have dairy... 

INSPYR Solutions

Senior Power BI Developer Job at INSPYR Solutions

Title: Senior Power BI DeveloperLocation: Houston, TX 77056 (Hybrid: 3 days onsite / 2 days remote)Duration: Long Term ContractWork Requirements: US Citizen, GC Holders or Authorized to Work in the U.S.Job Description:We are seeking a Senior Power BI Developer to... 

Optum

Associate Patient Care Coordinator - Edgewater, NJ Job at Optum

 ...make a difference in the lives of people who turn to us for care at one of our hundreds of locations across New York, New Jersey...  ...Caring. Connecting. Growing together. The Associate Patient Care Coordinator is responsible for the completion of set processes and... 

Minnesota Department of Employment and Economic Development

Document Services Specialist - Central Services Administrative Specialist, Senior - Paid Leave Job at Minnesota Department of Employment and Economic Development

 ...team, inspired by challenging work and united by shared values that...  ...This posting may be used to fill multiple positions. Qualifications...  ...submit a copy of your DD-214 form and other required...  ...Convenience Services: Chore services, home repair, trip planning, child/elder...