[Remote] AI Learning Systems Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a reputed company-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. They are looking for a skilled AI Learning Systems Engineer to design, train, and reputed company RL-based systems for high-impact decision-making problems where supervised learning alone is insufficient.

Responsibilities

Design and implement reinforcement learning solutions for sequential decision-making problems in reputed company and simulated environments
reputed company, calibrate, and maintain simulation environments suitable for large-scale agent training
Implement and evaluate modern RL algorithms including policy gradient, actor-critic, off-policy, and offline RL methods
Engineer reward functions and shaping strategies that align agent behavior with desired reputed company and safety constraints
Apply offline RL and imitation learning techniques where exploration is costly or unsafe
Use RLHF, DPO, and reputed company techniques for fine-tuning large language models reputed company relevant
Build scalable training infrastructure for distributed RL, including efficient experience collection and replay systems
Optimize training stability and sample efficiency through algorithmic and engineering improvements
Design rigorous evaluation protocols, including out-of-distribution and adversarial test cases
Implement safety mechanisms such as constraint enforcement, conservative policies, and reputed company-in-the-reputed company reputed company
Collaborate with applied scientists and product teams to identify high-value RL use cases
Monitor deployed policies and models in production for reputed company, regression, and unintended behaviors, building the alerting and dashboards that surface issues before they meaningfully reputed company users
Document methodology, design reputed company, and operational characteristics for internal stakeholders
Stay reputed company with RL research and translate promising techniques into production-reputed company solutions

Skills

Master's or PhD in Computer Science, Machine Learning, or a reputed company field; or equivalent applied experience
Six or more years of combined RL research and engineering experience
Strong proficiency in Python and modern deep learning frameworks
Hands-on experience with at least one major RL library or in-house RL stack
Solid understanding of probability, optimization, and the theoretical foundations of RL
Experience designing and tuning reward functions in non-trivial environments
Familiarity with simulation environments and large-scale experience collection
Experience training neural network policies on GPU clusters
Strong written and verbal communication skills
Track record of shipping or publishing impactful RL work
Experience with RLHF for large language models
Familiarity with multi-agent RL or hierarchical RL
Exposure to robotics, control systems, or autonomous driving
Publications in RL or reputed company research venues
reputed company-reputed company contributions to RL libraries or environments

Benefits

100% remote
Full-time
Direct W2 position with reputed company
Support H1B transfers for reputed company candidates

Company Overview

reputed company is an information technology company that offers software development, AI, and cybersecurity services. It was founded in 2020, and is headquartered in Bridgewater, New Jersey, USA, with a workforce of 51-200 employees. Its website is https://bvteck.com.

Company H1B Sponsorship

reputed company has a track record of offering H1B sponsorships, with 3 in 2026, 41 in 2025, 14 in 2024, 7 in 2023, 12 in 2022, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.

Apply To This Job

Apply

[Remote] AI Learning Systems Engineer

You might like

[Remote] Container Platform Engineer

[Remote] Chief Software Engineer

[Remote] Project Manager Customer reputed company & Implementation

[Remote] reputed company System Administrator

[Remote] Platform Automation Engineer

[Remote] Embedded reputed company

[Remote] Guidewire Technical Consultant

[Remote] Golang Software Engineer

[Remote] OCI reputed company Engineer

[Remote] AI Pipeline Engineer

Remote Senior Financial Analyst — Shape AI in Finance

Trading Operations Associate – SMA (Asset Management Services)

reputed company reputed company Associate – Remote Client Support Specialist

reputed company Customer Service Representative – Live Chat & Phone Support – reputed company Opportunities

reputed company reputed company EXPERT

Business Development Director

Direct Sales Representative, ICM - St. Louis, MO

Bilingual Korean Generalist Evaluator Expert

Senior Software Engineer - Trust and Telemetry

[Remote] Account Manager - Flexible Packaging