503 212 587

Daniel van Strien

davanstrien

https://danielvanstrien.xyz/

vanstriendaniel

davanstrien

AI & ML interests

Machine Learning Librarian

Articles

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

15 days ago

• 11

Data Is Better Together: A Look Back and Forward

16 days ago

• 14

Synthetic dataset generation techniques: generating custom sentence similarity data

May 23

• 13

Synthetic dataset generation techniques: Self-Instruct

May 15

• 5

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

May 7

• 7

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 36

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 14

Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub

Aug 2, 2023

The Hugging Face Hub for Galleries, Libraries, Archives and Museums

Jun 12, 2023

• 1

Introducing BERTopic Integration with Hugging Face Hub

May 31, 2023

• 2

Jupyter X Hugging Face

Mar 23, 2023

• 2

Image search with 🤗 datasets

Mar 16, 2022

• 5

Organizations

davanstrien's activity

commented a paper 4 days ago

Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER

Paper • 2407.01272 • Published 4 days ago • 6 •

New activity in davanstrien/magpie 4 days ago

Can I use this locally with other model like qwen2 7b etc?(+ this is support load in 8 bit model?)

#1 opened 4 days ago by

Clausss

New activity in mrm8488/magpie_llama-3-8b_spanish 4 days ago

add some more metadata :)

#1 opened 4 days ago by

davanstrien

commented a paper 5 days ago

Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

Paper • 2406.17720 • Published 10 days ago • 7 •

commented a paper 8 days ago

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Paper • 2406.19314 • Published 8 days ago • 12 •

New activity in Dahoas/rm-static 9 days ago

add dpo tag

#4 opened 9 days ago by

davanstrien

New activity in BeastyZ/cmteb_retrieval 10 days ago

add language

#2 opened 10 days ago by

davanstrien

New activity in ellamind/wikipedia-2023-11-reranking-multilingual 10 days ago

add language

#1 opened 10 days ago by

davanstrien

New activity in PKU-Alignment/PKU-SafeRLHF 11 days ago

add link to paper page

#2 opened 11 days ago by

davanstrien

New activity in tinystyler/tinystyler 11 days ago

add citation info

#1 opened 11 days ago by

davanstrien

commented a paper 15 days ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published 16 days ago • 15 •

New activity in sqrti/SPA-VL 17 days ago

Add citation info

#2 opened 17 days ago by

davanstrien

commented a paper 17 days ago

Large Scale Transfer Learning for Tabular Data via Language Modeling

Paper • 2406.12031 • Published 18 days ago • 6 •

commented a paper 18 days ago

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published 19 days ago • 10 •

commented a paper 19 days ago

GEB-1.3B: Open Lightweight Large Language Model

Paper • 2406.09900 • Published 22 days ago • 18 •

New activity in Salesforce/xlam-function-calling-60k 23 days ago

add synthetic tag

#4 opened 23 days ago by

davanstrien

Open sourcing APIGen?

#3 opened 23 days ago by

davanstrien

New activity in apple/DataCompDR-1B 24 days ago

small metadata suggestions

#2 opened 24 days ago by

davanstrien

New activity in tsynbio/ProteinLMBench 25 days ago

add citation info

#2 opened 25 days ago by

davanstrien

New activity in CropNet/CropNet 25 days ago

add link to arxiv paper

#1 opened 25 days ago by

davanstrien

New activity in biunlp/HeSum 26 days ago

add basic info to dataset card

#2 opened 26 days ago by

davanstrien

New activity in librarian-bots/new-datasets-in-machine-learning 26 days ago

Update README.md

#1 opened 26 days ago by

davanstrien

commented a paper about 1 month ago

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Models are Task-specific Classifiers

Paper • 2403.02839 • Published Mar 5 • 1 •

New activity in alvarobartt/replacing-judges-with-juries-distilabel about 1 month ago

add link to blog post :)

#3 opened about 1 month ago by

davanstrien

New activity in DIBT/MPEP_RUSSIAN about 1 month ago

Update README.md

#2 opened about 1 month ago by

davanstrien

Dataset card update checkpoint

#1 opened about 1 month ago by

ZennyKenny

New activity in Nerfgun3/bad_prompt about 1 month ago

move task related tags to task categories

#16 opened about 1 month ago by

davanstrien

commented 5 papers about 2 months ago

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 30 •

Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning

Paper • 2307.03692 • Published Jul 5, 2023 • 24 •

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 38 •

Generative AI for Synthetic Data Generation: Methods, Challenges and the Future

Paper • 2403.04190 • Published Mar 7 •

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 107 •

New activity in argilla/Capybara-Preferences about 2 months ago

Update README.md

#1 opened about 2 months ago by

davanstrien

commented 12 papers about 2 months ago

ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models

Paper • 2403.16187 • Published Mar 24 •

CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model

Paper • 2403.08350 • Published Mar 13 •

Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

Paper • 2404.10306 • Published Apr 16 •

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Paper • 2403.11808 • Published Mar 18 •

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

Paper • 2403.09192 • Published Mar 14 •

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Paper • 2403.14608 • Published Mar 21 •

MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of Experts

Paper • 2404.15159 • Published Apr 22 •

BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models

Paper • 2404.02827 • Published Apr 3 •

Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension

Paper • 2404.17991 • Published Apr 27 •

GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning

Paper • 2404.09163 • Published Apr 14 •

IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

Paper • 2404.16816 • Published Apr 25 • 1 •

Optimizing Language Model's Reasoning Abilities with Weak Supervision

Paper • 2405.04086 • Published May 7 • 1 •

New activity in davanstrien/cosmochat about 2 months ago

Improve third turn

#1 opened about 2 months ago by

davanstrien

New activity in teknium/openhermes 2 months ago

update tag

#5 opened 2 months ago by

davanstrien

New activity in DIBT/aya_dutch_dpo 2 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 2 months ago by

librarian-bot

New activity in ibm/KVP10k 2 months ago

add minimal card template with citation info

#2 opened 2 months ago by

davanstrien

New activity in Harvard-Edge/Wake-Vision 2 months ago

add minimal dataset card with link to paper

#5 opened 2 months ago by

davanstrien

New activity in tomasonjo/synthetic-text2cypher-gpt4turbo 2 months ago

Update README.md

#1 opened 2 months ago by

davanstrien

New activity in avramandrei/histnero 2 months ago

Add link to paper

#2 opened 2 months ago by

davanstrien

New activity in PleIAs/Post-OCR-Correction 2 months ago

tags and typo

#2 opened 2 months ago by

davanstrien

New activity in Eladio/emrqa-msquad 2 months ago

add outline for dataset card

#2 opened 2 months ago by

davanstrien

New activity in argilla/argilla-template-space-with-oauth 3 months ago

Bump Argilla version

#3 opened 3 months ago by

davanstrien

New activity in argilla/demo 3 months ago

Bump Argilla

#2 opened 3 months ago by

davanstrien

New activity in DIBT-Dutch/prompt-translation-for-Dutch 3 months ago

Bump argilla version to 1.27.0

#2 opened 3 months ago by

davanstrien

New activity in 2A2I/prompt-translation-for-Arabic 3 months ago

Upgrade argilla version to 1.27.0

#2 opened 3 months ago by

davanstrien

New activity in mistralai/Mixtral-8x22B-Instruct-v0.1 3 months ago

Add language metadata to model card

#5 opened 3 months ago by

davanstrien

New activity in BramVanroy/orca_dpo_pairs_dutch 3 months ago

Update metadata to add DPO tag and remove deprecated tag

#3 opened 3 months ago by

davanstrien

Daniel van Strien

AI & ML interests

Articles

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

Data Is Better Together: A Look Back and Forward

Synthetic dataset generation techniques: generating custom sentence similarity data

Synthetic dataset generation techniques: Self-Instruct

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Data is better together

Extracting Insights from Model Cards Using Open Large Language Models

Creating open machine learning datasets? Share them on the Hugging Face Hub!

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub

The Hugging Face Hub for Galleries, Libraries, Archives and Museums

Introducing BERTopic Integration with Hugging Face Hub

Jupyter X Hugging Face

Image search with 🤗 datasets

Organizations

davanstrien's activity

Can I use this locally with other model like qwen2 7b etc?(+ this is support load in 8 bit model?)

add some more metadata :)

add dpo tag

add language

add language

add link to paper page

add citation info

Add citation info

add synthetic tag

Open sourcing APIGen?

small metadata suggestions

add citation info

add link to arxiv paper

add basic info to dataset card

Update README.md

add link to blog post :)

Update README.md

Dataset card update checkpoint

move task related tags to task categories

Update README.md

Improve third turn

update tag

Librarian Bot: Add language metadata for dataset

add minimal card template with citation info

add minimal dataset card with link to paper

Update README.md

Add link to paper

tags and typo

add outline for dataset card

Bump Argilla version

Bump Argilla

Bump argilla version to 1.27.0

Upgrade argilla version to 1.27.0

Add language metadata to model card

Update metadata to add DPO tag and remove deprecated tag