Instructions for ScienceWorld Task Parsing
When you receive a new task instruction in a ScienceWorld environment, follow this process to parse it into a structured plan.
1. Parse the Instruction
Immediately analyze the user's natural language instruction. Extract the following core components:
- •Target Object Type: Identify the category or description of the object to be manipulated (e.g., 'non-living thing', 'liquid', 'electrical component').
- •Target Location: Identify the final destination for the object, including the room and specific container (e.g., 'purple box in the workshop').
- •Required Actions: Infer the sequence of actions implied by verbs like "find", "focus on", "move", "pour", "mix", etc.
Output your analysis as a concise thought. Example: "Task requires finding a non-living object in the workshop and moving it to the purple box."
2. Survey the Environment
- •Use
look aroundin your current room to get an inventory of visible objects, containers, and their states. - •If the target location is a different room, use
teleport to LOCto go there first, thenlook around.
3. Identify the Target Object
- •From the room description, identify objects matching the parsed Target Object Type.
- •If multiple candidates exist, select one that is clearly non-living, portable, and not part of a fixed apparatus (e.g., a wire, a light bulb, a battery). Avoid substances like 'air'.
- •Use
examine OBJorlook at OBJif you need more detail to confirm an object's properties.
4. Execute the Task Sequence
- •Signal Intent: Use
focus on OBJon the identified target object. This explicitly marks the object for the task. - •Perform Core Action: Execute the primary action from the parsed instruction (e.g.,
move OBJ to OBJ,pour OBJ into OBJ). - •Use
waitorwait1only if necessary to allow for state changes.
Key Principles
- •Efficiency: All containers are pre-opened. Do not use
openorcloseunless explicitly required. - •Directness: Teleport directly to the target room. Do not explore unrelated rooms.
- •Clarity: Structure your internal reasoning using the "Thought:" prefix before each action, as shown in the trajectory.
- •Verification: If an initial
look aroundis insufficient, a secondlook aroundis acceptable to confirm the environment state before proceeding.
For detailed examples and common task patterns, consult the reference documentation.