The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Edit this README.md markdown file to author your organization card.
models 7
BASF-AI/ChEmbed-full
Feature Extraction • 0.1B • Updated • 1.09k • 2
BASF-AI/ChEmbed-plug
Feature Extraction • 0.1B • Updated • 897
BASF-AI/ChEmbed-prog
Feature Extraction • 0.1B • Updated • 928
BASF-AI/ChEmbed-vanilla
Feature Extraction • 0.1B • Updated • 903
BASF-AI/ChemVocab
Updated
BASF-AI/nomic-bert-2048
0.1B • Updated • 1
BASF-AI/nomic-embed-text-v1.5
Sentence Similarity • 0.1B • Updated • 103
datasets 76
BASF-AI/ChemRxivRetrieval
Viewer • Updated • 79.5k • 96 • 1
BASF-AI/uspto-title-abs-chem
Viewer • Updated • 75.8k • 11
BASF-AI/uspto-synth-query-abs-chem
Viewer • Updated • 75.8k • 11
BASF-AI/PlantCAD2_virtual_hackathon
Viewer • Updated • 9 • 16
BASF-AI/dolma-pes2o-chemistry
Viewer • Updated • 361k • 27 • 1
BASF-AI/ChemRxiv-Papers
Viewer • Updated • 30.4k • 309 • 2
BASF-AI/ChemRxiv-Paragraphs
Viewer • Updated • 209k • 26 • 2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer • Updated • 139k • 19 • 1
BASF-AI/dolma-chem-only-query-generated
Viewer • Updated • 1.17M • 26
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer • Updated • 138k • 12 • 2