Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust. Job Summary: Anthropic is an AI research company focused on the safety and alignment of AI systems with human values. They are seeking a Machine Learning Infrastructure Engineer to build and scale the infrastructure that powers their AI safety systems, ensuring that models operate safely and reliably. Responsibilities: • Design and build scalable ML infrastructure to support real-time and batch classifier and safety evaluations across our model ecosystem • Build monitoring and observability tools to track model performance, data quality, and system health for safety-critical applications • Collaborate with research teams to productionize safety research, translating experimental safety techniques into robust, scalable systems • Optimize inference latency and throughput for real-time safety evaluations while maintaining high reliability standards • Implement automated testing, deployment, and rollback systems for ML models in production safety applications • Partner with Safeguards, Security, and Alignment teams to understand requirements and deliver infrastructure that meets safety and production needs • Contribute to the development of internal tools and frameworks that accelerate safety research and deployment Qualifications: Required: • 5+ years of experience building production ML infrastructure, ideally in safety-critical domains like fraud detection, content moderation, or risk assessment • Proficient in Python and have experience with ML frameworks like PyTorch, TensorFlow, or JAX • Hands-on experience with cloud platforms (AWS, GCP) and container orchestration (Kubernetes) • Understand distributed systems principles and have built systems that handle high-throughput, low-latency workloads • Experience with data engineering tools and building robust data pipelines (e.g., Spark, Airflow, streaming systems) • Results-oriented, with a bias towards reliability and impact in safety-critical systems • Enjoy collaborating with researchers and translating cutting-edge research into production systems • Care deeply about AI safety and the societal impacts of your work • At least a Bachelor's degree in a related field or equivalent experience Preferred: • Working with large language models and modern transformer architectures • Implementing A/B testing frameworks and experimentation infrastructure for ML systems • Developing monitoring and alerting systems for ML model performance and data drift • Building automated labeling systems and human-in-the-loop workflows • Experience in trust & safety, fraud prevention, or content moderation domains • Knowledge of privacy-preserving ML techniques and compliance requirements • Contributing to open-source ML infrastructure projects Company: Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values. Founded in 2021, the company is headquartered in San Francisco, California, USA, with a team of 501-1000 employees. The company is currently Late Stage. Anthropic has a track record of offering H1B sponsorships.