Site Reliability Engineer (SRE) - Chaos Job at Atlantis IT group, Ontario, CA

RUFTQmlmZlRyUGJxbitYT1llaXZDaHNka2c9PQ==
  • Atlantis IT group
  • Ontario, CA

Job Description

Site Reliability Engineer (SRE) - Chaos

Toronto

Role Description: Analyze equipment and system failure modes to prevent downtime.Develop and implement maintenance strategies.Use statistical analysis to predict system reliability and risk of failure.Collaborate with other engineers to ensure the reliability of new projects.Test and analyze parts and equipment to determine causes of malfunctions.Document and communicate reliability analysis and testing results.Design system upgrades for improved reliability and performance.Provide training and support to maintenance personnel.Create policies and procedures for inspection, maintenance, and repair methodsDesign and execute experiments to test system resilience.Identify weaknesses in systems and applications through controlled chaos.Develop strategies to improve system reliability and fault tolerance.Collaborate with development and operations teams to implement chaos engineering practices.Analyze the impact of failures and provide recommendations for improvements.Document and share findings to enhance overall system robustness


Essential Skills: Strong understanding of reliability engineering and statistical analysis.Knowledge of maintenance management and manufacturing processes.Ability to identify potential issues and develop effective solutions.Excellent communication and documentation skills.Proficiency in scripting and programming languages.Strong understanding of distributed systems and cloud infrastructure.Experience with chaos engineering tools and methodologies.Analytical mindset with the ability to identify and mitigate risks.Excellent problem-solving and communication skills.Proficiency in scripting and programming languages.Strong understanding of distributed systems and cloud infrastructure.Experience with chaos engineering tools and methodologies.Analytical mindset with the ability to identify and mitigate risks.Excellent problem-solving and communication skills.Proficiency in scripting and programming languages.Strong understanding of distributed systems and cloud infrastructure.Experience with chaos engineering tools and methodologies.Analytical mindset with the ability to identify and mitigate risks.Excellent problem-solving and communication skills.Proficiency in scripting and programming languages.Strong understanding of distributed systems and cloud infrastructure.Experience with chaos engineering tools and methodologies.Analytical mindset with the ability to identify and mitigate risks.Excellent problem-solving and communication skills.Proficiency in scripting and programming languages.Strong understanding of distributed systems and cloud infrastructure.Experience with chaos engineering tools and methodologies.Analytical mindset with the ability to identify and mitigate risks.

Job Tags

Similar Jobs

Snider Fleet Solutions

Heavy Duty Truck and Trailer SVC Tech Job at Snider Fleet Solutions

 ...components under supervision (must be a repair audited by a supervisor or senior technician).~PM Inspection, Brake job, Brake chamber and slack adjuster replacement, Wheel seal replacement and wheel end adjustment (RP618B), Trace an open or shorted circuit.~Willingness to... 

Five Star Call Centers

Customer Service Representative Job at Five Star Call Centers

 ...hero? Suit up and apply today! This is a work-at-home position for individuals living...  ...systems using dual monitors Previous remote work from home experience preferred Quick...  ...EST) ; Work Days - Mon-Fri + Every Other Weekend (Will have 2 days off during the week... 

Metro Meals on Wheels

Oferta de trabajo: Especialista en eventos Job at Metro Meals on Wheels

Especialista de eventos Resumen de trabajoAumente la conciencia sobre Meals on Wheels y los problemas de hambre para personas mayores en las Ciudades Gemelas trabajando con el Director de Marketing y Desarrollo para ejecutar y entregar la estrategia de marketing. Este... 

Veolia

Certified Water Treatment Operator I Job at Veolia

 ...North America (VNA) offers a full spectrum of water, waste and energy management services, including water, and wastewater treatment, commercial and hazardous waste collection...  ...THROUGH RECIPROCITY. NO EXCEPTIONS. The Operator I operates and maintains sludge, dewatering... 

ABB

Field Service Engineer, Propulsion Job at ABB

 ...work authorization that would permit them to work for ABB in the US More about us The Marine & Ports Division serves the shipping and ports industries through its extensive portfolio of integrated systems and solutions that improve the flexibility, reliability...