Incident IQ is seeking a DevOps Engineer to optimize and scale their cloud-native infrastructure on Microsoft Azure, focusing on Kubernetes and automation tools.
Company Overview Atlanta-based, Incident IQ is a SaaS service management platform built exclusively for K-12 schools that is transforming K-12 workflows including IT asset management, help desk ticketing, facilities maintenance solutions, Human Resources service delivery, and more. Our mission is to revolutionize how school districts manage operational support activities to better serve students and drive instructional efficiencies. Incident IQ is a dynamic, fast-growing company focusing on providing innovative cloud-based software. The Incident IQ platform has been rapidly adopted by K-12 school districts. Today, millions of students and teachers in districts across the U.S. rely on the IncidentIQ platform to manage and deliver mission-critical services. Since the company's founding, Incident IQ has built a culture focused on customer success and product leadership; we are passionate about helping school districts achieve operational efficiency. Incident IQ’s environment is inclusive and transparent, and our team members are respected and valued contributors who consistently exhibit openness, integrity, collaboration, enthusiasm, and effort.The DevOps Engineer is an experienced individual contributor who thrives in a fast-paced, product-focused environment. The ideal candidate will help us scale and optimize our cloud-native infrastructure on Microsoft Azure, bring deep expertise in Kubernetes, GitOps, and observability tooling like Prometheus, Grafana, and Loki. You’ll play a critical role in driving automation, reliability, and developer velocity across the engineering organization. Responsibilities: Lead the design, implementation, and scaling of infrastructure in Azure using Azure Kubernetes Service (AKS) and related services. Develop and maintain GitOps workflows (e.g., ArgoCD or Flux) to automate and manage application deployments and cluster configurations. Build and maintain observability systems using Prometheus, Grafana, and Loki for metrics, visualization, and logging. Drive CI/CD pipeline automation and infrastructure-as-code using tools such as Terraform, Helm, and Github CI/CD. Work cross-functionally with development teams to support feature delivery, deployment strategy, and runtime performance. Champion DevOps best practices including IaC, shift-left security, and proactive monitoring. Help ensure infrastructure scalability, availability, and resilience as we grow. Participate in incident response, post-mortems, and on-call rotations. Drive the evolution of our developer experience by designing, implementing, and maintaining scalable, and repeatable development environments that empower engineering teams to build and ship software with speed and confidence. Mentor team members and advocate for DevOps culture and continuous improvement. Requirements: 8+ years of hands-on DevOps or SRE experience, with significant time spent in cloud-based environments, preferably Azure. Strong production experience with Kubernetes (ideally AKS) and container orchestration. Proficient in Azure services such as Azure Resource Manager (ARM), Azure Monitor, Azure Networking, and Azure DevOps or equivalent. Deep knowledge of Prometheus, Grafana, and Loki for monitoring and logging. Demonstrated experience with GitOps configuration using ArgoCD. Expertise with Terraform, Helm, and infrastructure-as-code approaches. Strong scripting and automation skills (e.g., Python, Bash, PowerShell). Solid understanding of networking, Linux systems, and cloud security principles. Qualifications: Experience in high-growth startup or scale-up environments. Familiarity with Azure cost optimization, scalability patterns, and high-availability architecture. Exposure to service mesh (e.g., Istio, Linkerd) and secrets management tools (e.g., HashiCorp Vault, Azure Key Vault). Knowledge of compliance frameworks (SOC2, HIPAA, ISO 27001, etc.). Experience automating ephemeral development environment creation Soft Skills: Self-starter comfortable with ambiguity and fast iteration. Strong communication and collaboration skills. Able to lead by example and mentor across engineering teams. Passionate about automation, observability, and delivering value quickly. What makes Incident IQ different: We facilitate whole-person growth where employees can develop personally as well as professionally. We offer an energetic and collaborative environment; everyone’s opinion matters! We produce software that empowers K-12 schools to run efficiently, allowing for a better classroom experience for students to THRIVE! We provide excellent work/life balance. Two amazing offices - a Downtown Atlanta office location and one at Halcyon in Alpharetta! Incident IQ offers a competitive salary based on experience with a benefits package for full-time employees that includes medical, dental, vision, life insurance, 401k match, and paid-time off (PTO). Incident IQ is an Equal Opportunity Employer
Incident IQ is seeking a DevOps Engineer to optimize and scale their cloud-native infrastructure on Microsoft Azure, focusing on Kubernetes and automation tools.