*** Applicants must currently reside in the Omaha metro area***
As a Site Reliability Engineer, your primary responsibility will be to review, optimize, and complete the monitoring and alerting systems for our applications. You will work closely with development, operations, and product teams to ensure that our monitoring systems provide clear, actionable data and that our alerting mechanisms are finely tuned to detect issues before they impact our customers. Your work will be pivotal in transforming raw data into actionable intelligence, improving system observability, and enhancing the overall user experience.
RESPONSIBILITIES AND DUTIES:
KNOWLEDGE, SKILLS, AND ABILITIES:
Strong experience with monitoring and observability tools (e.g., Nagios, Prometheus, Grafana, ELK Stack, Datadog, New Relic).
Proficiency in scripting languages (e.g., Python, Bash, PowerShell) for automation.
Familiarity with cloud platforms (AWS, Azure, GCP) and hybrid cloud environments.
Understanding of infrastructure-as-code tools (e.g., Terraform, Ansible).
Knowledge of CI/CD pipelines and version control systems (e.g., Git, Jenkins).Basic understanding of networking, security, and system administration.
EDUCATION AND EXPERIENCE:
Bachelor's degree in Computer Science, Engineering, a related field, or equivalent experience.
Minimum of 3 years of experience in a Site Reliability Engineering or similar role, with a focus on monitoring and alerting in a SaaS environment.
WORK ENVIRONMENT AND PHYSICAL DEMANDS:
Normal office environment with use of computers and telephone systems; no unusual physical demands
Travel as needed, including business air travel and car rental
Date Posted | November 8, 2024 |
---|---|
Date Closes | January 7, 2025 |
Located In | Omaha, NE |
Job Type | Full-time Employee |
Compensation | Salary, Varies |
Shift | Custom |
SOC Category | 15-1133.00 Software Developers, Systems Software |
Zipcode | 68154 |
This job is related other jobs in these career categories