RL Research Engineer

Meril

British Indian Ocean Territory

Accepting Applications Full-time On-site
Posted 2 weeks, 2 days ago 3 views 0 applications
Job Description
**Job Title:** RL Research Engineer (Planning \& Control) **Location:** Vapi, Gujarat **Employment Type:** Full\-Time **Overview** We are seeking a highly skilled **Reinforcement Learning (RL) Research Engineer** specializing in planning and control. The role focuses on designing learning\-based planners and policies (RL, imitation learning, model\-based) and integrating them with classical control approaches to enable safe, efficient, and robust autonomous operation across multiple domains including humanoids, AGVs, cars, and drones. **Key Responsibilities** * Develop and train policies from human demonstrations and teleoperation data. * Implement safe reinforcement learning approaches with constraints. * Design long\-horizon planners using world models and uncertainty\-aware control. * Implement safety shields, fallback controllers, and verify\-before\-deploy pipelines. * Collaborate with cross\-functional teams to integrate RL policies with control systems. * Conduct sim\-to\-real transfer and ensure policies generalize in real\-world settings. * Design reward functions and implement offline RL and behavioral cloning strategies. **Must\-Haves** * 4–8\+ years of experience in RL and control systems. * Strong expertise in Model Predictive Control (MPC), Control Barrier Functions (CBFs), reachability analysis, or similar methods. * Master’s or PhD in Robotics, Control, AI, or a related field. * Experience with sim\-to\-real transfer, reward design, offline RL, and behavioral cloning. **Nice\-to\-Haves** * Experience with multi\-agent reinforcement learning. * Knowledge of hierarchical options and diffusion policies. * Familiarity with long\-horizon task planning in complex environments. **Success Metrics** * Task success rate in target domains. * Rate of human or system interventions during execution. * Compliance with energy, jerk, and other control limits. * Minimization of constraint violations in real\-world deployment. **Domain Notes** **Humanoids:** \- Stable locomotion and bimanual task RL. **AGVs (Autonomous Ground Vehicles):** \- Navigation in mixed human zones, traffic rule compliance, and aisle etiquette. **Cars:** \- Interactive merges, handling unprotected turns, and safe navigation in dynamic traffic. **Drones:** \- Wind\-robust flight, safe landing and perching maneuvers. **Application Instructions** Interested candidates may apply by sending their resume and cover letter to **parijat.patel@merai.co** with the subject line: **“RL Research Engineer (Planning \& Control) Application”** .
Login to Apply

Don't have an account? Register

About Company
Share this job