malteos's picture

malteos

malteos

commoncrawl

·

https://ostendorff.org

malteos

AI & ML interests

None yet

Recent Activity

updated a Space 2 days ago

commoncrawl/cc-citations

updated a Space 9 days ago

malteos/some-tests

published a Space 10 days ago

malteos/some-tests

View all activity

Organizations

authored 10 papers 11 months ago

Tokenizer Choice For LLM Training: Negligible or Crucial?

Paper • 2310.08754 • Published Oct 12, 2023 • 3

Towards an Open Platform for Legal Information

Paper • 2005.13342 • Published May 27, 2020

Aspect-based Document Similarity for Research Papers

Paper • 2010.06395 • Published Oct 13, 2020

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Paper • 2202.06671 • Published Feb 14, 2022 • 2

Specialized Document Embeddings for Aspect-based Similarity of Research Papers

Paper • 2203.14541 • Published Mar 28, 2022

Investigating Gender Bias in Turkish Language Models

Paper • 2404.11726 • Published Apr 17, 2024 • 1

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning

Paper • 2301.09626 • Published Jan 23, 2023 • 2

Progress Report: Towards European LLMs

Paper • 2410.03730 • Published Sep 30, 2024 • 3

Data Processing for the OpenGPT-X Model Family

Paper • 2410.08800 • Published Oct 11, 2024 • 1

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 43