Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.3k 564

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.1k 126

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 229

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 7.2k 471

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 637 54

Repositories

Showing 10 of 493 repositories
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    allenai/dolma’s past year of commit activity
    Python 1,119 Apache-2.0 126 24 20 Updated Mar 4, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 2,746 Apache-2.0 351 15 12 Updated Mar 4, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 67 Apache-2.0 18 2 17 Updated Mar 4, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 5,292 Apache-2.0 564 45 54 Updated Mar 4, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    allenai/ai2-scholarqa-lib’s past year of commit activity
    Python 70 Apache-2.0 7 0 1 Updated Mar 4, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 10 Apache-2.0 5 1 2 Updated Mar 4, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 7,244 Apache-2.0 471 37 17 Updated Mar 4, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 29 Apache-2.0 2 7 6 Updated Mar 3, 2025
  • pixmo-docs Public

    Synthetic data generation pipelines for text-rich images.

    allenai/pixmo-docs’s past year of commit activity
    Python 40 Apache-2.0 8 0 0 Updated Mar 1, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,283 Apache-2.0 229 243 4 Updated Feb 27, 2025