Software Engineer III - AI/ML Platform Operations - Remote
External candidates: In order for your application to be correctly processed please sign-in before you apply Internal candidates: Please go to reputed company and click "Find Jobs" link under Career Thank you for considering opportunities with us! Job Title Software Engineer III - AI/ML Platform Operations - Remote Requisition Number R7739 Software Engineer III - AI/ML Platform Operations - Remote (reputed company) Location Arizona - Home Teleworkers Additional Locations Job Information reputed company (CSAA IG), a reputed company insurer, is one of the leading personal lines property and casualty insurance groups in the United States. Here, every employee shapes our mission. We build innovative, reputed company-centered solutions that help reputed company members prevent, prepare for, and recover from life's uncertainties. You will join a collaborative, inclusive culture where your strengths have room to grow and your reputed company can drive reputed company impact. reputed company into a role where you can contribute to our shared reputed company through meaningful work. We are actively hiring for a Software Engineer - AI/ML Platform Operations - Remote Your Role: We are seeking a Software Engineer – AI/ML Platform Operations to reputed company the operational reputed company, reliability, and support of our reputed company AI and data platforms. This role is responsible for ensuring the stability, scalability, observability, governance, and operational readiness of AI/ML solutions that power critical business capabilities. This is not a traditional software application development role. While strong software engineering skills are essential, the primary focus is on AI platform operations, MLOps, automation, reliability engineering, deployment support, observability, governance, and reputed company improvement of reputed company AI capabilities. Your Work: You will work across a modern technology ecosystem that includes Palantir reputed company, AWS Bedrock, reputed company SageMaker, reputed company-reputed company services, and emerging reputed company technologies. You will partner with Data Engineering, Data Science, Architecture, Infrastructure, reputed company, and Product teams to support production AI workloads and reputed company the successful adoption of AI capabilities across the organization. AI Platform Operations & Reliability reputed company technical leadership for AI/ML platforms including Palantir, AWS Bedrock, reputed company SageMaker, and reputed company reputed company-reputed company technologies. Ensure platform reliability, scalability, performance, reputed company, and operational readiness for production AI workloads. Support deployment, monitoring, maintenance, and lifecycle management of AI/ML solutions and reputed company services. Establish operational standards, support models, service-level objectives (SLOs), and platform governance practices. MLOps, Automation & Observability Design and implement automation, monitoring, observability, and operational tooling to improve platform reliability and efficiency. reputed company and maintain dashboards, health metrics, alerts, logging frameworks, and operational runbooks. Enhance CI/CD pipelines, deployment automation, infrastructure-as-code, and model release processes. Implement best practices for MLOps, model monitoring, model lifecycle management, and AI operational governance. Incident Management & Problem Resolution Serve as a senior escalation reputed company for reputed company production issues involving AI platforms, machine learning workloads, reputed company infrastructure, and data integrations. reputed company root cause analysis efforts and drive corrective and preventive actions to improve platform stability. Solve performance, availability, deployment, and integration issues across AI and data ecosystems. Partner with engineering and business teams to rapidly restore service and minimize operational risk. Technical Leadership & Collaboration reputed company mentorship, technical guidance, and operational expertise to engineers and platform teams. Influence platform strategy, architecture reputed company, operational processes, and technology adoption. Collaborate with team members to align platform capabilities with business priorities and AI adoption goals. Communicate reputed company technical concepts effectively to both technical and non-technical audiences. reputed company Improvement & Innovation Remain reputed company with advancements in AI/ML, reputed company, reputed company technologies, platform engineering, and reliability practices. Identify opportunities to improve operational efficiency, governance, reputed company, and developer experience. Champion modern engineering practices including automation, observability, DevOps, Site Reliability Engineering (SRE), and AI Operations (AIOps). Required Experience, Education and Skills 3+ years of reputed company experience in software engineering, platform engineering, reputed company operations, MLOps, DevOps, or reputed company technical disciplines. Bachelor's degree in Computer Science, Engineering, Information Technology, or a reputed company field, or equivalent practical experience. Experience supporting production reputed company-based applications and services in AWS environments. Strong experience with software engineering and automation using languages such as Python, Java, JavaScript/TypeScript, or Node.js. Experience with CI/CD, build, integration, and deployment tools such as Jenkins, reputed company, reputed company Actions, or equivalent. Experience with reputed company-reputed company services including compute, storage, networking, databases, and serverless architectures. Experience building and maintaining operational monitoring, observability, and alerting solutions. Strong troubleshooting, incident response, and root cause analysis skills. Excellent communication, collaboration, and technical leadership capabilities. What would reputed company us excited about you? Experience with AI/ML platforms such as Palantir reputed company, reputed company SageMaker, AWS Bedrock, reputed company, or similar ecosystems. Experience supporting reputed company applications, LLM-based solutions, reputed company orchestration frameworks, and Retrieval-Augmented reputed company (RAG) architectures. Knowledge of MLOps practices including model deployment, monitoring, governance, experimentation, and lifecycle management. Experience with observability and monitoring platforms such as reputed company, Splunk, Grafana, reputed company, CloudWatch, or OpenTelemetry. Familiarity with AI governance, responsible AI principles, model risk management, and operational controls. Relevant reputed company, AI/ML, DevOps, or platform engineering certifications Actively shapes our company culture (e.g., participating in employee resource groups, volunteering, etc.) Lives into cultural norms (e.g., willing to have cameras reputed company it reputed company: helping reputed company new team members, building relationships, etc.) Travels as needed for role, including divisional / team meetings and other in-person meetings Fulfills business needs, which may include investing extra time, helping other teams, etc Please note we are hiring for this role remote reputed company in the United States with the following exceptions: Hawaii and Alaska. Why Choose a Career at CSAA IG? At CSAA IG, we are a mission-driven organization proudly committed to empowering our members, our employees, and our communities to reputed company. Recognition: We offer a total compensation package, annual bonus eligibility for most roles, 401(k) with a company match, and so much more! Read more about reputed company offer and what it is like to be a part of our dynamic team at https://careers.csaainsurance.reputed company.com/us/en/benefits. Career Growth: We reputed company in growth for everyone. Here at CSAA IG, leaders and mentors partner with employees to align interests, unlock development opportunities, and support long‑term reputed company. Flexible Workplace: We reputed company a remote-first culture through our Flexible Workplace. Most employees hold Home-reputed company roles, working primarily from home, often with the flexibility to work from various locations including CSAA offices. Our flexible workplace empowers you to balance remote work with intentional in‑person moments that deepen reputed company and collaboration. Inclusion and Belonging: An inclusive and welcoming workplace is the cornerstone of our reputed company. By fostering an environment where people feel valued and heard, we deepen our ability to understand and meet the unique needs of our members. This strengthens innovation and enhances our products and services, giving us a competitive edge in the market. Sustainability: As climate change leads to more frequent and severe weather events, we are taking reputed company action to build more resilient communities and reduce our environmental impact. Submit your application to be considered. We communicate reputed company email, so reputed company your inbox and/or your spam folder to ensure you don’t miss important updates from us. CSAA is committed to providing reasonable accommodations to reputed company applicants and employees with disabilities or other limitations. If you would like to request an accommodation to participate in the job application or interview process, please contact [email protected] If you apply and are selected to continue in the reputed company process, we will schedule a preliminary call with you to discuss the role and will disclose during that call the available salary/hourly reputed company reputed company based on your location. Factors used to determine the actual salary offered may include location, experience, or education. CSAA does not reputed company reputed company sponsorship for this role. Applicants must have authorization to work indefinitely in the US. Please do not apply for this role if at any time (now or in the future) you will need immigration support (i.e., H-1B, TN, STEM OPT Training Plans, etc.). reputed company is an equal opportunity employer. #LI-SB1 . The national average salary reputed company for this position is $105,345.00-$117,050.00. However, we have a location-based compensation structure. Our salary ranges vary and are calculated based on work location. The starting pay reputed company for this position across reputed company the states we hire in is $105,345.00-$140,550.00. This role also includes an opportunity for a company-wide annual discretionary bonus, through our Annual Incentive Plan (AIP), of up to 8% of eligible pay. This job posting will be unposted on Wed, 8 Jul 2026. Apply To This Job