Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Ivan's picture
256 5 12

Ivan

ivandolgov
11-47's profile picture
·
  • johndolgov

AI & ML interests

NLP, RL

Recent Activity

upvoted an article 2 days ago
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
upvoted a collection 2 days ago
Mellum 2
upvoted a paper 2 days ago
Mellum2 Technical Report
View all activity

Organizations

JetBrains's profile picture

upvoted an article 2 days ago
view article
Article

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains
•
2 days ago
• 21
upvoted a collection 2 days ago

Mellum 2

Collection
Mellum2 model weights • 6 items • Updated 2 days ago • 77
upvoted a paper 2 days ago

Mellum2 Technical Report

Paper • 2605.31268 • Published 6 days ago • 49
liked 6 models 2 days ago

JetBrains/Mellum2-12B-A2.5B-Base-Pretrain

Text Generation • 12B • Updated 2 days ago • 52 • 8

JetBrains/Mellum2-12B-A2.5B-Base

Text Generation • 12B • Updated 2 days ago • 832 • 13

JetBrains/Mellum2-12B-A2.5B-Instruct-SFT

Text Generation • 12B • Updated 2 days ago • 57 • 7

JetBrains/Mellum2-12B-A2.5B-Thinking-SFT

Text Generation • 12B • Updated 2 days ago • 118 • 16

JetBrains/Mellum2-12B-A2.5B-Instruct

Text Generation • 12B • Updated 2 days ago • 644 • 44

JetBrains/Mellum2-12B-A2.5B-Thinking

Text Generation • 12B • Updated 2 days ago • 6.94k • 174
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs