文章预览
2024-12-27 14:19
本条微博链接
几篇论文实现代码: 《Long-Form Speech Generation with Spoken Language Models》(2024) GitHub: github.com/google-deepmind/librispeech-long 《DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs》(2024) GitHub: github.com/MengLcool/SliMM 《DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT》(2024) GitHub: github.com/YvanYin/DrivingWorld [fig1] 《WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents》(2024) GitHub: github.c
………………………………