@kaisugi on Hugging Face: "🚀 Stockmark-100b Stockmark Inc. has developed and released one of Japan's…"

kaisugi

posted an update May 18, 2024

Post

1581

🚀 Stockmark-100b

Stockmark Inc. has developed and released one of Japan's largest commercial-scale Language Models (LLM) with 100 billion parameters, named "Stockmark-LLM-100b". This model significantly reduces hallucinations and provides accurate responses to complex business-related queries. Developed from scratch with a focus on Japanese business data, the model aims to be reliable for high-stakes business environments. It's open-source and available for commercial use.

Key highlights:
- The model reduces hallucinations—incorrect confident responses that AI models sometimes generate.
- Stockmark-LLM-100b can answer basic business questions and specialized queries in industries like manufacturing.
- The model's performance surpasses GPT-4-turbo in accuracy for business-specific queries.
- Evaluation benchmarks (VicunaQA) show high performance.
- Fast inference speed, generating 100-character Japanese text in 1.86 seconds.

stockmark/stockmark-100b
stockmark/stockmark-100b-instruct-v0.1

Detailed press release (in Japanese): https://stockmark.co.jp/news/20240516

Priyankvadaliya

May 18, 2024

Stockmark's unveiling of the "Stockmark-LLM-100b" showcases a remarkable advancement in the realm of language models, particularly in business applications. You talked about its reduction of AI hallucinations, but how does it address potential biases inherent in its training data, especially considering its vast parameters? If, for instance, we consider deploying Stockmark-LLM-100b in financial decision-making processes, how would you mitigate the risk of biased outputs influencing critical business strategies?

kaisugi

May 18, 2024

That's a good point.
I'm not an employee of this company or working in the financial sector, but I do know that people involved have actively discussed in which case they should make use of LLMs. I guess LLMs won't replace humans' decision-making processes, but rather augment them.

osanseviero

May 19, 2024

This is impressive, great work!

Join the conversation