Learning to Plan with Personalized Preferences

Manjie Xu*, Xinyi Yang*, Wei Liang✉, Chi Zhang✉ , Yixin Zhu✉
(* equal contribution, ✉ corresponding authors)

Effective integration of AI agents into daily life requires them to understand and adapt to individual human preferences, particularly in collaborative roles. We develop agents that not only learn preferences from few demonstrations but also learn to adapt their planning strategies based on these preferences.

Checklist

✅ Environment Setup
✅ Benchmark Generation
✅ Baseline Implementation
🕳️ Plannning & Demo Generation
🕳️ Robot Interface

Code Structure

PbP
├── baselines          # Code for replicating baselines
│     ├── EILEV     
│     ├── GPT      
│     ├── Llama3        
│     ├── LLaVA-NeXT  
│     ├── opt
│     └── run_gui.py
├── OmniGibson         # The simulating platform built upon NVIDIA's Omniverse
├── benckmark
│     ├── draw  
│     ├── examples      
│     ├── level_0      # examples of the pre-defined level_x preferences
│     ├── level_1  
│     ├── level_2
│     ├── ...  
│     ├── action.py    # some definition of primitive actions
│     ├── scene_config.yaml    # config of the scene in Omniverse
│     └── video_recoder.py     # the recorder of the demo samples
├── LICENCE
├── README.md
└── setup.py

Install

You should firstly install NVIDIA Omniverse and Omnigibson based on the following tutorials:

NVIDIA Omniverse: https://docs.omniverse.nvidia.com/install-guide/latest/index.html  
Omnigibson: https://behavior.stanford.edu/omnigibson/getting_started/installation.html

Our code is developed on the version 2023.1.1 of Isaac Sim. We recommend using a machine with RAM 32GB+ and NVIDIA RTX 2070+ following OmniGibson's requirements. The code can be run on a headless machine via docker, but we recommend using a machine with a display.

After successfully install Omnigibson, test on some demos:

# simple navigation
python benchmark/examples/nav_demo.py
# pick and place
python benchmark/examples/pick_and_place.py

Benchmark

We define human preference in three different levels. We provide code to help sample demos of these preference in different scenes. You can also modify the sampling code to generate more customed demos.

cd level_x && python sample.py

where sample.py mainly consists the sampling loop, preference.py mainly consists the main preference logic, and task.py hosts the task environment.

We also provide a subset of our sampled dataset at huggingface. (Note: This dataset is unfiltered and may include some crash demos caused by instability during rendering in Omnigibson. We recommend performing the rendering process independently for optimal results.)

A demonstration includes an egocentric video of agent activity, a bird's-eye-view map tracking agent position, and frame-level action annotations. We also provide third-person view recordings for enhanced visualization.

We prioritize the egocentric perspective for two reasons: 1) it offers a clear view with minimal occlusions, and 2) it aligns with human perception, facilitating transfer to real-world data from head-mounted devices.

Baselines

We provide all our implementations of the baselines in the baseline folder. For each baseline, we recommend creating an independent conda environment to avoid conflict. Navigate to each folder to see the install steps.

Ack

NVIDIA Omniverse
Omnigibson

Cite us

@article{xu2024preference,
  title={Learning to Plan with Personalized Preferences},
  author={Xu, Manjie and Xinyi Yang and Wei Liang and Chi Zhang and Yixin Zhu},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
baselines		baselines
benchmark		benchmark
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Learning to Plan with Personalized Preferences

Checklist

Code Structure

Install

Benchmark

Baselines

Ack

Cite us

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

ariesssxu/preference-based-planning

Folders and files

Latest commit

History

Repository files navigation

Learning to Plan with Personalized Preferences

Checklist

Code Structure

Install

Benchmark

Baselines

Ack

Cite us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages