SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 22 days ago • 17
Distributionally Robust Optimization with Bias and Variance Reduction Paper • 2310.13863 • Published Oct 21, 2023
The Benefits of Balance: From Information Projections to Variance Reduction Paper • 2408.15065 • Published Aug 27, 2024 • 1
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement Paper • 2504.07934 • Published Apr 10 • 20
SuperBPE Collection SuperBPE tokenizers and models trained with them • 9 items • Updated 22 days ago • 17
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Paper • 2503.20672 • Published Mar 26 • 14
DataComp: In search of the next generation of multimodal datasets Paper • 2304.14108 • Published Apr 27, 2023 • 2
Scalable Extraction of Training Data from (Production) Language Models Paper • 2311.17035 • Published Nov 28, 2023 • 3
Git Re-Basin: Merging Models modulo Permutation Symmetries Paper • 2209.04836 • Published Sep 11, 2022 • 2
PLeaS -- Merging Models with Permutations and Least Squares Paper • 2407.02447 • Published Jul 2, 2024