Syllabus

Schedule

Readings: ● required   ○ optional

DateTopicPresenters
Week 1
Fri, Jan 24

Lecture Course Overview and Computer Vision Basics

Week 2
Fri, Jan 31

Lecture Robotics Basics and Machine Learning Basics

Part I: Robot Perception
Week 3
Fri, Feb 7

2D Perception

  • DINOv2: Learning Robust Visual Features without Supervision. Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski (2023) [Presenter: Maksym Bondarenko]
  • Segment Anything 2. Nikhila Ravi, Valentin Gabeur, Yuan-Ting Hu, Ronghang Hu, Chaitanya Ryali, Tengyu Ma, Haitham Khedr, Roman Rädle, Chloe Rolland, Laura Gustafson, Eric Mintun, Junting Pan, Kalyan Vasudev Alwala, Nicolas Carion, Chao-Yuan Wu, Ross Girshick, Piotr Dollár, Christoph Feichtenhofer (2024) [Presenter: Seyoung Ree]
  • Emerging Properties in Self-Supervised Vision Transformers. Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin (2021)
  • Segment Anything. Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick (2023)

3D Perception

Maksym Bondarenko
Seyoung Ree
Lingyi Zhang
Dan Harvey
Week 4
Fri, Feb 14

Representation Learning for Robotics

Model Learning for Robotics

TA Yixuan Wang
Prof. Yunzhu Li
Sammy Agrawal
Nikolaus Holzer
Shuo Sha
Week 5
Fri, Feb 21

Gaussian Splatting

Multimodal Perception

Yutao Mao
Philippe Wu
Bingyao Du
Jifeng Li
Week 6
Fri, Feb 28

Visual Tracking

Aakash Aanegola
Frank Fan
Charles Xu
Rodolfo Raimundo
Part II: Robot Decision Making

Grasping

Week 7
Fri, March 7

Visual Affordance

Imitation Learning

Jason Zou
Priyanka Varghese
Tianjun Zhong
Yolanda Zhu
Naian Tao
Week 8
Fri, March 14

Model-Based Planning

Task and Motion Planning

Hongyu Li
Kheri Hughes
Alexander Du
Feiyang Chen
Carl Gross
Week 9
Fri, March 21
No Class - Spring Recess
Week 10
Fri, March 28

Embodied AI Benchmark

  • BEHAVIOR-1K: A Benchmark for Embodied AI with 1,000 Everyday Activities and Realistic Simulation. Chengshu Li*, Ruohan Zhang*, Josiah Wong*, Cem Gokmen*, Sanjana Srivastava*, Roberto Martín-Martín*, Chen Wang*, Gabrael Levine*, Michael Lingelbach, Jiankai Sun, Mona Anvari, Minjune Hwang, Manasi Sharma, Arman Aydin, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R Matthews, Ivan Villa-Renteria, Jerry Huayang Tang, Claire Tang, Fei Xia, Silvio Savarese, Hyowon Gweon, Karen Liu, Jiajun Wu, Li Fei-Fei (2022) [Presenter: Prof. Yunzhu Li]
  • Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots. Xavier Puig, Eric Undersander, Andrew Szot, Mikael Dallaire Cote, Tsung-Yen Yang, Ruslan Partsey, Ruta Desai, Alexander William Clegg, Michal Hlavac, So Yeon Min, Vladimír Vondruš, Theophile Gervet, Vincent-Pierre Berges, John M. Turner, Oleksandr Maksymets, Zsolt Kira, Mrinal Kalakrishnan, Jitendra Malik, Devendra Singh Chaplot, Unnat Jain, Dhruv Batra, Akshara Rai, Roozbeh Mottaghi (2024) [Presenter: Adithi Narayan & Code Reporter: Yuan Fang]

Dexterous Manipulation

  • Learning Dexterous In-Hand Manipulation. OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba (2018) [Presenter: William Wang]
  • Visual Dexterity: In-hand Dexterous Manipulation from Depth. Tao Chen, Megha Tippur, Siyang Wu, Vikash Kumar, Edward Adelson, Pulkit Agrawal (2022) [Presenter: Danelle Tuchman]
Adithi Narayan
Yuan Fang
William Wang
Danelle Tuchman
Part III: Cutting-edge topics
Week 11
Fri, April 4

Scalable Robot Data

Program Synthesis for Embodied Agents

Charan Santhirasegaran
Ishaan Mahajan
Nico Bykhovsky
Karla Zúñiga
Week 12
Fri, April 11

Vision-Language-Action Model

  • OpenVLA: An Open-Source Vision-Language-Action Model. Moo Jin Kim, Karl Pertsch, Siddharth Karamcheti, Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, Chelsea Finn (2024) [Presenter: Tom Zollo & Code Reporter: John Wendlandt]
  • $\pi_0$: A Vision-Language-Action Flow Model for General Robot Control. Kevin Black, Noah Brown, Danny Driess, Adnan Esmail, Michael Equi, Chelsea Finn, Niccolo Fusai, Lachy Groom, Karol Hausman, Brian Ichter, Szymon Jakubczak, Tim Jones, Liyiming Ke, Sergey Levine, Adrian Li-Bell, Mohith Mothukuri, Suraj Nair, Karl Pertsch, Lucy Xiaoyang Shi, James Tanner, Quan Vuong, Anna Walling, Haohuan Wang, Ury Zhilinsky (2024) [Presenter: Emre Adabag]

World Model

  • Video Language Planning. Yilun Du, Mengjiao Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Kaelbling, Andy Zeng, Jonathan Tompson (2023) [Presenter: Hao Zou]
  • Cosmos World Foundation Model Platform for Physical AI. NVIDIA: Niket Agarwal, Arslan Ali, Maciej Bala, Yogesh Balaji, Erik Barker, Tiffany Cai, Prithvijit Chattopadhyay, Yongxin Chen, Yin Cui, Yifan Ding, Daniel Dworakowski, Jiaojiao Fan, Michele Fenzi, Francesco Ferroni, Sanja Fidler, Dieter Fox, Songwei Ge, Yunhao Ge, Jinwei Gu, Siddharth Gururani, Ethan He, Jiahui Huang, Jacob Huffman, Pooya Jannaty, Jingyi Jin, Seung Wook Kim, Gergely Klár, Grace Lam, Shiyi Lan, Laura Leal-Taixe, Anqi Li, Zhaoshuo Li, Chen-Hsuan Lin, Tsung-Yi Lin, Huan Ling, Ming-Yu Liu, Xian Liu, Alice Luo, Qianli Ma, Hanzi Mao, Kaichun Mo, Arsalan Mousavian, Seungjun Nah, Sriharsha Niverty, David Page, Despoina Paschalidou, Zeeshan Patel, Lindsey Pavao, Morteza Ramezanali, Fitsum Reda, Xiaowei Ren, Vasanth Rao Naik Sabavat, Ed Schmerling, Stella Shi, Bartosz Stefaniak, Shitao Tang, Lyne Tchapmi, Przemek Tredak, Wei-Cheng Tseng, Jibin Varghese, Hao Wang, Haoxiang Wang, Heng Wang, Ting-Chun Wang, Fangyin Wei, Xinyue Wei, Jay Zhangjie Wu, Jiashu Xu, Wei Yang, Lin Yen-Chen, Xiaohui Zeng, Yu Zeng, Jing Zhang, Qinsheng Zhang, Yuxuan Zhang, Qingqing Zhao, Artur Zolkowski (2025) [Presenter: Kuo Gong]
Tom Zollo
John Wendlandt
Emre Adabag
Hao Zou
Kuo Gong
Week 13
Fri, April 18

Locomotion

Mobile Robots

Boshra Khalili
Linlin Zhang
Seojin Yoon
Zirun Wang
Lei Huang
Week 14
Fri, April 25
No Class - Attending Conference
Week 15
Fri, May 2
Spotlight Final Project Spotlights I Aakash Aanegola, Danelle Tuchman, Sammy Agrawal
Alexander Du, Emre Adabag, Ishaan Mahajan
Feiyang Chen, Kuo Gong, Jifeng Li
Hao Zou, Linlin Zhang, Bingyao Du
Frank Anthony Fan, Rodolfo Costa Raimundo
John Wendlandt, Carl Gross
Dan Harvey
Nikolaus Holzer, Hongyu Li
Lei Huang
Boshra Khalili
Maksym Bondarenko, Charan Santhirasegaran
Week 16
Fri, May 9
Spotlight Final Project Spotlights II Yutao Mao, Lingyi Zhang
Adithi Narayan, Priyanka Rose Varghese
Seyoung Ree, Seojin Yoon, Karla Nicole Zuniga
Kheri Hughes, Nico Bykhovsky
Shuo Sha
Charels Xu, Naian Tao
Zirun Wang
Jason Zou, Tianjun Zhong
Yolanda Zhu
Thomas Zollo
William Wang, Philippe Wu