PyoSignal Logo
PyoSignal
Back to Research

UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

Paper ID: 2604.19734 โ€ข 27 Upvotes
Robotics Humanoid Transfer Learning World Modeling RAG Vision Video Benchmark Distillation Safety
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

๐Ÿ“ ํ•ต์‹ฌ ์š”์•ฝ

UniT๋Š” ์ธ๊ฐ„์˜ ํ–‰๋™ ๋ฐ์ดํ„ฐ๋ฅผ ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์— ํšจ๊ณผ์ ์œผ๋กœ ์ „๋‹ฌํ•˜์—ฌ ๋กœ๋ด‡ ํ•™์Šต ๋ฐ์ดํ„ฐ ๋ถ€์กฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ , ์‹ค์ œ ๋กœ๋ด‡ ์ œ์–ด ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐ ๊ธฐ์—ฌํ•œ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋‚ด์šฉ

ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ๊ธฐ์ดˆ ๋ชจ๋ธ ํ•™์Šต์€ ๋กœ๋ด‡ ๋ฐ์ดํ„ฐ ๋ถ€์กฑ์œผ๋กœ ์–ด๋ ค์›€์„ ๊ฒช๊ณ  ์žˆ๋‹ค. ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋Œ€๊ทœ๋ชจ ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์žˆ์ง€๋งŒ, ์ธ๊ฐ„๊ณผ ๋กœ๋ด‡์˜ ์‹ ์ฒด ๊ตฌ์กฐ ์ฐจ์ด๋กœ ์ธํ•ด ์ง์ ‘์ ์ธ ์ ์šฉ์ด ์–ด๋ ต๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์‹œ๊ฐ์  ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ธ๊ฐ„๊ณผ ๋กœ๋ด‡์˜ ํ–‰๋™์„ ์—ฐ๊ฒฐํ•˜๋Š” UniT ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. UniT๋Š” ํ–‰๋™๊ณผ ์‹œ๊ฐ ์ •๋ณด ๊ฐ„์˜ ์ƒํ˜ธ ์žฌ๊ตฌ์„ฑ์„ ํ†ตํ•ด ์‹ ์ฒด ๊ตฌ์กฐ์— ๋…๋ฆฝ์ ์ธ ํ–‰๋™ ํ‘œํ˜„์„ ํ•™์Šตํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ์ธ๊ฐ„ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•œ ๋กœ๋ด‡ ์ •์ฑ… ํ•™์Šต ๋ฐ ์„ธ๊ณ„ ๋ชจ๋ธ๋ง์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค. ์‹คํ—˜ ๊ฒฐ๊ณผ, UniT๋Š” ํœด๋จธ๋…ธ์ด๋“œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋ฐ ์‹ค์ œ ๋กœ๋ด‡ ํ™˜๊ฒฝ์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์˜€์œผ๋ฉฐ, ์ธ๊ฐ„์˜ ํ–‰๋™์„ ๋กœ๋ด‡ ์ œ์–ด์— ํšจ๊ณผ์ ์œผ๋กœ ์ด์ „ํ•  ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ–ˆ๋‹ค.

๐Ÿ”‘ ์ฃผ์š” ๋‚ด์šฉ (Key Points)

  • ์ธ๊ฐ„ ํ–‰๋™ ๋ฐ์ดํ„ฐ์™€ ๋กœ๋ด‡ ํ–‰๋™ ๋ฐ์ดํ„ฐ ๊ฐ„์˜ ๊ฐ„๊ทน์„ ํ•ด์†Œํ•˜๋Š” ์ƒˆ๋กœ์šด ํ”„๋ ˆ์ž„์›Œํฌ UniT ์ œ์‹œ
  • ์‹œ๊ฐ์  ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์‹ ์ฒด ๊ตฌ์กฐ์— ๋…๋ฆฝ์ ์ธ ํ–‰๋™ ํ‘œํ˜„ ํ•™์Šต
  • ์ •์ฑ… ํ•™์Šต ๋ฐ ์„ธ๊ณ„ ๋ชจ๋ธ๋ง์„ ํ†ตํ•ด ํœด๋จธ๋…ธ์ด๋“œ ๋กœ๋ด‡์˜ ์ œ์–ด ์„ฑ๋Šฅ ํ–ฅ์ƒ

๐Ÿ’ก ์‹ค๋ฌด์  ๊ฐ€์น˜ (Relevance)

๋กœ๋ด‡ ์ œ์–ด ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ ์‹œ, ์ธ๊ฐ„์˜ ํ–‰๋™ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๋กœ๋ด‡์˜ ํ•™์Šต ํšจ์œจ์„ฑ์„ ๋†’์ด๊ณ , ์ƒˆ๋กœ์šด ๋™์ž‘์„ ๋น ๋ฅด๊ฒŒ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

โœ… ์ถ”์ฒœ ์•ก์…˜ (Actionable Items)

  • UniT ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ž์ฒด ๋กœ๋ด‡ ํ”Œ๋žซํผ์— ์ธ๊ฐ„ ํ–‰๋™ ๋ฐ์ดํ„ฐ ์ ์šฉ ์‹คํ—˜
  • UniT๋ฅผ ํ™œ์šฉํ•œ ๋กœ๋ด‡ ์ •์ฑ… ํ•™์Šต ๋ฐ ์„ธ๊ณ„ ๋ชจ๋ธ๋ง ์„ฑ๋Šฅ ๋น„๊ต ๋ถ„์„
  • ๋‹ค์–‘ํ•œ ์‹œ๊ฐ์  ํŠน์ง• ์ถ”์ถœ ๋ฐฉ๋ฒ•์„ UniT์— ์ ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ ํ–ฅ์ƒ ๊ฐ€๋Šฅ์„ฑ ํƒ์ƒ‰