Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning Paper • 2508.04581 • Published Aug 6 • 5
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning Paper • 2508.04581 • Published Aug 6 • 5
COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Paper • 2509.22075 • Published Sep 26 • 21
SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection Paper • 2404.06137 • Published Apr 9, 2024
Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts Paper • 2503.15948 • Published Mar 20
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images Paper • 2505.07704 • Published May 12 • 29
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 26
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 26
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5 • 26
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets Paper • 2305.11625 • Published May 19, 2023 • 1
Iterative Self-Training for Code Generation via Reinforced Re-Ranking Paper • 2504.09643 • Published Apr 13 • 34