University of Washington

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

alisawuffles updated a collection about 1 month ago

alisawuffles updated a collection about 1 month ago

weikaih authored a paper about 2 months ago

Task Me Anything

View all activity

Papers

Proactive Hearing Assistants that Isolate Egocentric Conversations

View all Papers

alisawuffles

updated a collection about 1 month ago

SuperBPE

SuperBPE tokenizers and models trained with them • 9 items • Updated 22 days ago • 17

LeonLeng

updated a Space 2 months ago

README

LeonLeng

published a Space 2 months ago

README

ronakdm

authored 3 papers 4 months ago

Distributionally Robust Optimization with Bias and Variance Reduction

Paper • 2310.13863 • Published Oct 21, 2023

The Benefits of Balance: From Information Projections to Variance Reduction

Paper • 2408.15065 • Published Aug 27, 2024 • 1

A Generalization Theory for Zero-Shot Prediction

Paper • 2507.09128 • Published Jul 12

alisawuffles

in UW/OLMo2-8B-SuperBPE-t180k 8 months ago

Training code for Tokenizer

#1 opened 8 months ago by

kevinlin311tw

authored a paper 8 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10 • 20

alisawuffles

updated a dataset 8 months ago

UW/olmo-mix-1124-subset-p99

Updated Apr 10 • 275 • 2

alisawuffles

updated a collection 8 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 9 items • Updated 22 days ago • 17

kevinlin311tw

authored a paper 9 months ago

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published Mar 26 • 14

Jhayase

authored 7 papers 9 months ago

DataComp: In search of the next generation of multimodal datasets

Paper • 2304.14108 • Published Apr 27, 2023 • 2

Scalable Extraction of Training Data from (Production) Language Models

Paper • 2311.17035 • Published Nov 28, 2023 • 3

Query-Based Adversarial Prompt Generation

Paper • 2402.12329 • Published Feb 19, 2024

Git Re-Basin: Merging Models modulo Permutation Symmetries

Paper • 2209.04836 • Published Sep 11, 2022 • 2

Scalable Fingerprinting of Large Language Models

Paper • 2502.07760 • Published Feb 11

PLeaS -- Merging Models with Permutations and Least Squares

Paper • 2407.02447 • Published Jul 2, 2024

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17 • 13

Jhayase

published a model 9 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 9 • 2