Early access / Euro-Tech Hackathon

Robots that learn the job themselves.

ReflexOS turns a robot arm into an MCP server. An AI agent operates it, watches what happens, recovers from its own mistakes, and saves what works as a reusable reflex, so a new task needs far less teleoperation and engineering.

Human-defined goals & safety limitsBuilt for logistics, ports, warehouses & manufacturing
agent / reflexos-mcp
live session
tools: get_state / camera / move_joint / grip / verifyexplore -> verify -> save

Built for the places where every workflow is a little different

Logistics
Ports
Warehouses
Manufacturing
The bottleneck

Teaching a robot a new task is still slow, costly, and human-bound.

The real cost is rarely the hardware. It is the time and expertise needed to adapt the robot to each new environment, object, and workflow. Today that adaptation looks like one of these, usually all of them.

01

Human teleoperation

An operator drives the arm by hand for hours so the system has something to imitate.

02

Leader-follower demonstrations

Every motion is shown on a second arm, then replayed and hand-tuned until it holds.

03

Simulation datasets

Engineers build and label synthetic scenes that still break the moment reality differs.

04

Endless correction loops

Policies are retuned, more data is collected, the robot is retrained, and the cycle repeats.

Warehouse engineer manually operating a robot arm with a teach pendant
01

Human teleoperation

Engineer wearing a motion-capture exoskeleton to demonstrate robot movements
02

Leader-follower demonstrations

Massive array of simulated robots with sim-to-real transfer visualization
03

Simulation datasets

Robotics engineer frustrated after thousands of failed training episodes
04

Endless correction loops

The shift

From demonstration-first to exploration-first.

A human still owns the objective and the boundaries. The agent does the trial, the correction, and the workflow discovery.

Engineer wearing a motion-capture exoskeleton to guide a robot arm by hand

Demonstration-first / today

  • A human demonstrates every task by hand
  • Motions are hardcoded into fixed trajectories
  • A changed object position breaks the workflow
  • New tasks mean more demos and more engineering
AI training system autonomously reaches 91.3% success rate
91.3% success rate

Exploration-first / ReflexOS

  • A human defines the goal and the safety limits
  • The agent explores the robot's real action space
  • Failures are diagnosed, corrected, and retried
  • What works is saved as a reusable reflex
The AI-supervised training loop

Operate, observe, correct, remember.

The loop runs with the agent in control of the trial and error. It keeps tightening until a workflow is reliable enough to become a reflex.

01

Expose the robot as tools

Joints, camera, gripper, movement, state, and safety limits all become MCP tools the agent can call.

02

Give the agent a goal

A plain objective: pick, place, sort, inspect, or recover. No trajectory, no script.

03

Let the agent operate

The agent inspects state, tests possible actions, and drives the arm in a real or simulated scene.

04

Observe the result

Camera and sensor feedback confirm whether the action actually succeeded or failed.

05

Reason and correct

On failure the agent explains why, adjusts the grasp or approach, and tries a better strategy.

06

Record successful behavior

The movement, rationale, sensor state, and outcome are saved as robot memory.

07

Convert memory into reflexes

Repeated successful workflows stop needing reasoning and replay as fast, reliable skills.

08

Reduce training time

Each future workflow needs fewer human demonstrations and less engineering intervention.

The loop repeats: every recovery makes the next attempt cheaper.
Live 3D simulation

Mission control, not a mock console.

The animated console above shows the story. The live demo puts you in a 3D warehouse scene with a robot arm, camera view, and agent panel — the same training loop, rendered in real time.

  • 6-DOF arm + gripper
  • Live camera feed
  • Agent tool calls
What the agent works with

One standard interface for the whole robot.

MCP turns the robot into something an agent can read and reason about, not a black box behind a custom SDK.

The robot's body, as tools

Camera, state, joints, gripper, movement, home position, and safety limits are exposed as callable MCP tools. Each action becomes part of an action space the agent understands.

get_state()joint angles, forces
camera.capture()scene frame
move_to(pose)constrained motion
grip.open / close()actuation
verify(goal)success | failure
home() / limits()safe pose, bounds

Agent-driven exploration

The agent inspects state, sees which movements are possible, tries positions, watches outcomes, and corrects, instead of waiting for a human to demonstrate.

Memory becomes reflexes

A successful trajectory, its rationale and sensor state are stored. Seen again, the workflow replays as a fast reflex instead of reasoning from scratch.

Cross-robot skill transfer

A skill is a workflow, not a fixed motion path. If a new arm exposes equivalent tools, the agent retests and saves a robot-specific reflex.

Synthetic-to-real correction

The agent compares the simulated plan with the real outcome, finds where it broke, and records the physical correction as reusable memory.

Human-owned boundaries

People define the objective and the safety envelope. Joint and force limits stay enforced while the agent does the trial and error.

Use case / warehouse sorting

A sorting robot that trains itself on the line.

Traditionally, engineers collect demonstrations, program fixed motions, test edge cases, and hand-correct failures. Move the package or miss the grip, and the workflow breaks.

With ReflexOS the arm connects as an MCP server. The agent sees the package, checks joint state, tests a grasp, verifies the pick, and places it. When it misses, it does not stop. It reasons about the failure, tries a new grasp, and saves the recovery.

reflex sort_to_left_binlearned
  • How to approach the object
  • Which grasp angle actually works
  • Which joint sequence stays safe
  • How to verify the object was picked
  • How to recover from a missed grasp
  • How to place it in the correct bin
built from real attempts: 4 steps, 1 recovery, 0 human demos
Business value

Cut the cost of putting robots to work.

By moving trial, correction, and discovery onto the agent, ReflexOS targets the expensive parts of every deployment. These are the outcomes the system is built to deliver as it matures.

Human teleoperation hours
Engineering setup time
Task-specific programming
Retraining cycles
Failed deployments
Downtime caused by edge cases
The cost of adapting robots to new sites
The long-term vision is a new training layer for robot workers, where robots are not reprogrammed for every workflow but learn through AI-guided operation, memory, and reflex formation.
Questions

Answers before you connect an arm.

Any arm that can expose its camera, state, joints, gripper, movement, home position, and safety limits as MCP tools. The interface is what matters, not the specific brand of hardware.

Early access

Let the agent do the training.

Connect a robot as an MCP server, set a goal and safety limits, and watch it learn the workflow, recover from failures, and turn what works into reflexes.

No spam. Just an invite when your hardware can connect.