Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs Paper • 2406.20086 • Published 10 days ago • 3 • 4
Multi-property Steering of Large Language Models with Dynamic Activation Composition Paper • 2406.17563 • Published 14 days ago • 4 • 1
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published 20 days ago • 7 • 1