Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
OctoLong 's Collections
Instruct Checkpoints
Merged Base Checkpoints
Extended Base Checkpoints
Original Base Checkpoints

Original Base Checkpoints

updated about 19 hours ago

Qwen3 checkpoints with modified configurations for long context fine-tuning in the OctoLong project

Upvote
-

  • OctoLong/Qwen3-0.6B-Base

    Text Generation • 0.6B • Updated 1 day ago • 22

  • OctoLong/Qwen3-1.7B-Base

    Text Generation • 2B • Updated 1 day ago • 125

  • OctoLong/Qwen3-4B-Base

    Text Generation • 4B • Updated 1 day ago • 107

  • OctoLong/Qwen3-8B-Base

    Text Generation • 8B • Updated 1 day ago • 113 • 1

  • OctoLong/Qwen3-14B-Base

    Text Generation • 15B • Updated 1 day ago • 83
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs