Hybrid Senior Site Reliability Engineer

Job ID
130000
Job Type
Contract
Industry
Manufacturing
Pay Rate
$70 - $75 an hour
Location
Newark, California, United States
Talascend is currently seeking a hybrid Senior Site Reliability Engineer for a W-2 contract position located in Newark, California.

HYBRID SCHEDULE:
3 days a week onsite.
2 days a week remote.

SUMMARY:

This role will contribute to working on open VPN upgrades, AMQX upgrades, etc.

PRIMARY RESPONSIBILITIES:
  • Own and enhance the reliability of services deployed across various cloud regions. You will proactively monitor, automate, and scale services to ensure seamless uptime and performance.
  • Lead the containerization and deployment of microservices and data pipelines on Kubernetes, using Helm charts, ensuring best practices for scalability and fault tolerance.
  • Foster and advocate for a DevOps culture that emphasizes automation, self-service, and engineering excellence. Enable development teams to manage and deploy applications seamlessly with minimal intervention.
  • Implement autoscaling strategies and monitor the performance of applications and infrastructure with tools like Prometheus, Grafana, and other observability platforms.
  • Perform SRE tasks such as availability monitoring, incident response, post-mortem analysis, and preparing reliability reports for leadership and stakeholders.
  • Deploy, configure, and maintain essential cloud services and tools including Kafka, Spark, Presto, Airflow, MQTT, and other microservices platforms in a cloud-native environment.
  • Set up and manage cloud infrastructure using tools like Terraform, Cluster API, and other IaC frameworks, ensuring seamless provisioning, management, and scaling of resources.
  • Continuously enhance and automate alerting, incident detection, and recovery mechanisms for critical applications and services to minimize downtime and improve system reliability.
  • Participate in an on-call rotation to meet business SLAs, quickly troubleshoot and resolve issues, and document runbooks for consistent incident management processes.
  • Work closely with Product Owners, Engineering Managers, and cross-functional teams in Agile Scrum and Kanban workflows to deliver iterative improvements and meet evolving business needs.
  • Perform impact analysis during incidents, collaborate with teams for root cause analysis, and implement preventive measures to avoid recurrence.

POSITION REQUIREMENTS:
  • US Citizenship or Green Card Holder.
  • 8+ years in Site Reliability Engineering (SRE), DevOps Engineering, or related fields
  • 7+ years experience with Cloud architecture or engineering
  • 7+ years experience with DevOps
  • 7+ years experience with Kubernetes
  • Bachelor’s degree or Master’s. degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
  • 4+ years of hands-on experience deploying, managing, and optimizing containerized applications using Docker and Kubernetes in both public and private cloud environments (AWS, GCP, Azure, etc.).
  • 4+ years in Infrastructure-as-Code (IaC) using Terraform, Cluster API, or similar automation frameworks to manage cloud infrastructure.
  • Experience in scripting or programming with Python, Go, Bash/Shell, or similar languages.
  • Strong understanding of using Prometheus, Grafana, and other monitoring and observability tools.
  • Ability to effectively diagnose and resolve performance bottlenecks within AWS at the infrastructure and application layers.
  • Experience with configuration management and automation tools such as Ansible, Chef, or Puppet (preferred but not required).
  • CERTIFICATIONS REQUIRED: AWS Cloud Certification or OCI Certification.

We thank all applicants for their interest. However, only those qualified individuals who closely meet the qualifications of the position will be contacted. The details of the position are only a summary, other duties may be assigned as necessary.

Drug Screen may be required.

Apply Now

We will consider qualified applicants with a criminal history pursuant to Federal, State and Local law, including the California Fair Chance Act, the Los Angeles County Fair Chance Ordinance and the San Francisco Fair Chance Ordinance. You do not need to disclose your criminal history or participate in a background check until a conditional job offer is made to you. After making a conditional offer and running a background check, if Talascend is concerned about conviction that is directly related to the job, you will be given the chance to explain the circumstances surrounding the conviction, provide mitigating evidence, or challenge the accuracy of the background report. Applicants with criminal histories are encouraged to apply. To find out more visit California Fair Chance Act.

Pay range is not a guarantee of compensation or salary, as final offer amount may vary based on factors including but not limited to experience and geographic location. Talascend also offers a variety of benefits including: health and disability insurance, 401(k), EAP, paid time off, and company-paid holidays. The specific programs and options available to an employee may vary depending on date of hire, plan requirements, schedule type, and client work site mandates.

Talascend is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.

Start a new job search