generalization via modularity
play

Generalization via Modularity Deepak Chris Trevor Phillip - PowerPoint PPT Presentation

Learning to Control Self-Assembling Morphologies Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell Isola Efros * equal contribution How do we train a robot? Multiple tasks Expert


  1. Learning to Control Self-Assembling Morphologies Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell Isola Efros * equal contribution

  2. How do we train a robot?

  3. Multiple tasks   Expert demonstrations Rewards, labels  … 

  4. Self-supervision Multiple tasks    Curious exploration  Expert demonstrations Learning “common sense” Rewards, labels   … …  

  5. . . .

  6. . . . … even earlier?

  7. Single to Multicellular

  8. Single to Multicellular competition  collaboration

  9. Single to Multicellular competition  collaboration shared objective

  10. Compositionality has been useful in language … [Andreas et. al. 2016]

  11. How to implement compositionality in hardware?

  12. Modular Co-evolution of Control and Morphology

  13. Modular Co-evolution of Control and Morphology Cylindrical Limb

  14. Modular Co-evolution of Control and Morphology Cylindrical Limb Configurable Motor Joint

  15. Modular Co-evolution of Control and Morphology

  16. Modular Co-evolution of Control and Morphology

  17. Modular Co-evolution of Control and Morphology Potential Magnetic Joint

  18. Modular Co-evolution of Control and Morphology Potential Magnetic Joint

  19. Modular Co-evolution of Control and Morphology Acts as single agent upon joining Rewards are shared! Potential Magnetic Joint

  20. Modular Co-evolution of Control and Morphology Acts as single agent upon joining Rewards are shared!  Input = Local Sensory State  Output = Torques, Link, Unlink Potential Magnetic Joint

  21. Modular Co-evolution of Control and Morphology Acts as single agent upon joining Rewards are shared!  Input = Local Sensory State  Output = Torques, Link, Unlink Potential Magnetic Joint

  22. Consider the task of “standing up” …

  23. How to learn compositional controllers?

  24. Idea: Shared policy network across limbs Node Node Node Node Nod Node Node in Node Node Node Node Node Node

  25. Idea: Shared policy network across limbs output Node Node shared Node Node Nod Node Node policy in 𝜌 𝜄 Node Node Node Node Node Node input

  26. How to adapt when morphology changes?

  27. How to adapt when morphology changes?

  28. Network as reusable LEGO Blocks

  29. Network as reusable LEGO Blocks output shared policy 𝜌 𝜄 input

  30. Network as reusable LEGO Blocks message output output shared policy 𝜌 𝜄 input message input

  31. Network as reusable LEGO Blocks message output output shared same policy dimension 𝜌 𝜄 input message input

  32. Network as reusable LEGO Blocks message output output shared policy 𝜌 𝜄 input message input

  33. Network as reusable LEGO Blocks message output output shared policy 𝜌 𝜄 input message input

  34. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy 𝜌 𝜄 input message input

  35. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy 𝜌 𝜄 input message input

  36. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy 𝜌 𝜄 input message input

  37. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy cut 𝜌 𝜄 input message input

  38. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy cut and paste 𝜌 𝜄 𝜌 𝜄 input message input 𝜌 𝜄 𝜌 𝜄

  39. Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared adaptation by policy cut and paste conditioning 𝜌 𝜄 𝜌 𝜄 input message input 𝜌 𝜄 𝜌 𝜄

  40. Dynamic Graph Networks

  41. BTW, basically curriculum learning but in hardware

  42. How well does it generalize?

  43. . . .

  44. . . . a bit crazy… and totally useless!

  45. Self-Assembling Robots in the Real World [Mark Yim’s Lab at UPenn] [Daniela Rus's Lab at MIT] Also: [Modular Snake Robot – Howie Choset’s Lab at CMU]

  46. code & data at https://people.eecs.berkeley.edu/~pathak/ Poster # 197 …today!! (Multi-agent RL) Thank You!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend