Site Reliability Engineer Job at Quadrant IQ Solutions LLC, San Antonio, TX

ZEttbHEvejVWMW8yUzJPZS94MDFwQzNXRnc9PQ==
  • Quadrant IQ Solutions LLC
  • San Antonio, TX

Job Description

Role: Lead Site Reliability Engineer with Java

Location: San Antonio, Texas

Relevant Experience: 14+ Years

Note: Need a SRE who is very good in Java, coding round will be there as well.

Job Description & Key Responsibilities:

As a Lead Site Reliability Engineer (SRE), you will leverage your extensive experience in SRE practices to maintain and enhance the reliability, performance, and scalability of mission-critical systems. You will play a crucial role in ensuring the continuous availability and optimal functioning of our services.

Key Responsibilities:

  • Senior-Level SRE Expertise: Apply your deep understanding of SRE principles to lead efforts in improving system reliability and operational efficiency.
  • Incident Management: Provide expert-level support during incidents, ensuring swift resolution with minimal service disruption. Lead post-incident reviews to drive continuous improvement.
  • Monitoring & Alerting: Design, implement, and optimize monitoring, alerting, and incident response processes. Ensure the effectiveness of these systems to proactively address potential issues.
  • Automation: Drive the automation of manual processes to enhance operational efficiency, reduce human error, and increase overall system resilience.
  • CI/CD Pipeline Management: Develop, maintain, and improve automated CI/CD pipelines using tools such as GitLab CI/CD and Jenkins, ensuring seamless and reliable deployment processes.
  • Cross-Functional Collaboration: Work closely with cross-functional teams to ensure the reliability, performance, and scalability of our infrastructure. Foster a culture of collaboration and knowledge sharing.
  • Support Across Time Zones: Provide support across all U.S. time zones, with the flexibility to work weekends, rotational shifts, and overtime as required to maintain service continuity.

Required Skills & Qualifications:

  • Java Programming: Advanced proficiency in Java, with a deep understanding of contemporary software development practices.
  • Kubernetes & Containerization: Extensive hands-on experience with Kubernetes, including containerization technologies like Docker and Kubernetes storage solutions such as Portworx.
  • Linux/Unix Systems: Strong command of Linux/Unix operating systems and Shell Scripting (BASH), with a focus on system reliability and automation.
  • Functional Programming: Proficiency in functional programming languages such as Prolog, Haskell, and OCaml.
  • Scripting & Automation: Experience with Python or Go, particularly in the context of scripting and automation tasks.
  • Virtualization: In-depth knowledge of VMware and other virtualization platforms, with a focus on optimizing virtual environments for reliability and performance.
  • Streaming Technologies: Expertise with Kafka Stream Generator, KSQLDB, cluster federation, and Spark Streams, including experience in managing and optimizing streaming data architectures.
  • Service Mesh & Networking: Familiarity with Istio and Anthos Service Mesh, with the ability to manage and optimize service meshes for complex environments.
  • Performance Monitoring & Debugging: Proficiency in using EBPF (Extended Berkeley Packet Filter) for performance monitoring and debugging.

  • Monitoring & Logging Tools: Experience with industry-standard monitoring and logging tools such as Splunk, Prometheus, Datadog, and Kiali.
  • Load Balancing: Familiarity with Nginx Controller and Seesaw for effective load balancing and traffic management.
  • Infrastructure-as-Code (IaC): Competence in using Terraform for managing cloud infrastructure, ensuring consistency and scalability across environments.

Additional Requirements:

  • Flexibility: Willingness to work weekends, rotational shifts, and provide 24/7 support as necessary to maintain service reliability and meet project deadlines.
  1. Certifications Required: Kubernetes, Azure

Job Tags

Remote job, Shift work, Weekend work,

Similar Jobs

Disneyland Resort

Housekeeping Room Attendant - Part Time Job at Disneyland Resort

The Housekeeping team helps create special memories and a home-away-from-home experience for each of our Guests every day. Cast Members on the Housekeeping team are responsible for the overall cleaning and replenishing of amenities for all hotel Guest Rooms. They are expected...

Authorium

Head of Finance Job at Authorium

 ...We are seeking a highly skilled and strategic Head of Finance to join our team at Authorium. This individual will be responsible for...  ...Additionally, the role encompasses responsibilities in financial accounting, planning, and reporting, ensuring the company's financial... 

Northwestern Medical Center

Travel Surgical Technologist - $1,627 per week Job at Northwestern Medical Center

 ...Certification Details BLS Surg Tech Certification Job Details ~ NMC is currently recruiting a certified scrub/surgical technician traveler for a 13-week assignment. Job Requirements ~2-3 years experience ~ Surg Tech Certification ~ BLS required... 

The Spartan Group

Automotive Parts Manager Job at The Spartan Group

 ...Job Description Job Description Our long-term Parts Manager is retiring, which opens up a rare opportunity for a proven leader to work with a fantastic, well-established team. This is an excellent career opportunity for the right individual. Schedule: 5 day work... 

Flamingo Appliance Service

Appliance Repair Technician Job at Flamingo Appliance Service

 ...authorized to work in the United States. We are seeking an Appliance Repair Technician to become part of our Atlanta team! You will perform all...  ...abreast of the latest technologies and developments in home appliances. Ultimately, an outstanding Appliance Repair Technician...