Hands-On: Basic VLA Agent (Design & Lab Plan)

This is a lab plan (no code). Use it as a handout to run a supervised demo or to implement later in a separate repo.

Goal

Robot receives a high-level text command (e.g., "move to the red block"), identifies the target via perception, generates a plan and issues motion commands. All steps are described below conceptually.

Lab Steps (textual)

Define inputs & outputs:
- Input: natural language command text.
- Perception: list of detected objects and positions.
- Output: motion intent / velocity commands.
Language stage (design):
- Outline how to parse commands into (action, object) pairs.
- Specify ambiguity-handling rules and confirmation policy.
Perception stage (design):
- List sensors used and the expected data formats.
- Describe an object table format: name, position_x, position_y, confidence.
Planner stage (design):
- Define logic: select target object → compute relative offset → generate simple motion primitive.
Execution & Safety (design):
- State checks before executing: path clear, joint limits, timeouts.
- Define safe-stop behavior.
Evaluation:
- Success criteria (arrive within 0.2 m), timeouts, number of re-plans.

Deliverable (for students)

A one-page design spec describing each stage, message schemas, and test cases.

Goal​

Lab Steps (textual)​

Deliverable (for students)​

Goal

Lab Steps (textual)

Deliverable (for students)