WeChalet

Home Websites Country Category

Home>China>large-model>Anima: The first open-source 33B Chinese language model based on QLoRA

Anima: The first open-source 33B Chinese language model based on QLoRA

Chinese Websites： https://github.com/lyogavin/Anima Enter The Website

GitHub - lyogavin/Anima: 第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM

The first open-source 33B Chinese language model based on QLoRA.

The AI community has always been very open, and the development of AI today cannot be separated from many important open source works, open shared papers, or open source data and code. We believe that the future of AI will also be open. I hope to make some contributions to the open source community.

Why is the 33B model important? Is QLoRA a Game Changer?

Previously, most of the open-source models that could be finetuned were relatively small models 7B or 13b, although they could perform well through finetune training on some simple chatbot evaluation sets. However, due to the limited scale of these models, the reasoning ability of LLM core is relatively weak. That's why many of these small-scale models behave like toys in practical application scenarios. As discussed in this work, the chatbot evaluation set is relatively simple, and there is still a clear gap between small and large models in complex logical reasoning and mathematical problems that truly test the model's ability.

Therefore, we believe that the work of QLoRA is very important, to the point where it could be a Game Changer. Through the optimization method of QLoRA, for the first time, a 33B scale model can be trained with a more democratic and low-cost finetune, and widely used. We believe that the 33B model can not only leverage the strong reasoning ability of large-scale models, but also flexibly fine tune training for private business domain data to enhance control over LLM.

Recommend

User New Product Launched

AI Video Generation Hub - Explore and compare leading AI video tools like Minimax and Luma

2024-11-18

Readkidz | AI-Powered Children's e-Picture Books & Story Creation | Video BedtimeStory ChildrenSong

2024-11-18

Rubii AI - AI Native Fandom Character UGC Platform

2024-11-18

Crazy translator -not leaving home to translate the world

2024-11-17

YunGe - File Transfer Assistant Web Version

2024-11-06

Essay - A purely friendly writing community

2024-11-06

OnlyFans Kit - Free Creator Tools for OnlyFans Success

2024-11-06

Free AI Voice Generator : Text to Speech

2024-11-05

Guangzhou Zirui Robotics - Robot repair, maintenance and after-sales service

2024-11-05

Unlimited Wordless Online: Guess the Word in 6 Tries!

2024-11-04

Poster Generator: Free Online AI Poster Maker & Generator!

2024-10-29

Newsboy Column | Use data to help you choose

2024-10-29

Recent Hottest

New set -Discover global high -quality videos and creators, grow with millions of creators

2024-11-16

More Bread - Generating Revenue for Creators

2024-11-10

ArtStation | Art Station

2024-11-10

OldmanEmu.net

2024-11-12

Bookmark Earth - China's first browser bookmark sharing search engine platform

2024-11-10

Jiumo Search - Document Search Engine

2024-11-10

Hardycore Film Guide -enough high -definition is the real hard core!

2024-11-17

ACG World | A anime Website of the Spiritual World

2024-11-10

Tuyue - online word frequency analysis tool - word cloud map making software

2024-11-16

Kota Academic - Research and Academic Resource Navigation Platform | www.sciping.com

2024-11-10

Easy to read

2024-11-16

Anime Sharing and Exchange Website - Cartoon Pile

2024-11-10

Hot Websites

New set -Discover global high -quality videos and creators, grow with millions of creators

2024-11-16

More Bread - Generating Revenue for Creators

2024-11-10

ArtStation | Art Station

2024-11-10

OldmanEmu.net

2024-11-12

Bookmark Earth - China's first browser bookmark sharing search engine platform

2024-11-10

Jiumo Search - Document Search Engine

2024-11-10

Hardycore Film Guide -enough high -definition is the real hard core!

2024-11-17

ACG World | A anime Website of the Spiritual World

2024-11-10

Tuyue - online word frequency analysis tool - word cloud map making software

2024-11-16

Kota Academic - Research and Academic Resource Navigation Platform | www.sciping.com

2024-11-10

Easy to read

2024-11-16

Anime Sharing and Exchange Website - Cartoon Pile

2024-11-10

Hot Countries

China America Britain Germany Canada France Japan Australia Holland Italy Switzerland Spain Russia Sweden Singapore Ireland SouthKorea Korea poland Denmark

Hot Categories

tools university network business education life material culture mobile enterprise originality government news magazine music tour blog brand video game