NeoByBy's picture

6 3

NeoByBy

NeoByBy

·

AI & ML interests

None yet

Organizations

NeoByBy's activity

upvoted a paper 14 days ago

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published 15 days ago • 53

upvoted a paper 22 days ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published about 1 month ago • 54

upvoted 2 collections 2 months ago

multimodal

93 items • Updated about 19 hours ago • 3

VisionLM

333 items • Updated 1 day ago • 23

upvoted 2 papers 2 months ago

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13 • 36

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27 • 12