Cognition - a Ksgk-fy Collection

Ksgk-fy 's Collections

Exciting Papers

Memory

What I don't understand

Cognition

updated about 3 hours ago

Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24 • 38
Note General model is not great at specializing tasks. Narrow-domain fine-tuned checkpoint becomes better at specific tasks, such local improvement can feedback onto the full training dataset, achieving self-augmentation based improvement. This is a interesting idea.
Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 118

Note Use small language model to search the graph and route to the doman expert.
Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26 • 47
Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25 • 3
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions

Paper • 2409.08596 • Published 3 days ago • 1
What Makes a Maze Look Like a Maze?

Paper • 2409.08202 • Published 4 days ago • 1