According to Svmuu, DeepSeek has released a preview version of its V4 series open-source models under the MIT license, with model weights now available on Hugging Face and ModelScope.
This series includes two MoE models: the V4-Pro features approximately 1.6 trillion total parameters with 49 billion activated per token, while the V4-Flash has 284 billion total parameters and 13 billion activated per token. Both support a 1 million token context window. The company stated that, compared to the V3.2 version, memory usage and computational costs for long-text inference are significantly reduced.
Disclaimer:All content on this platform is sourced from the internet and is provided for informational purposes only. None of the content represents the views of this site, nor does it constitute investment advice. Please exercise caution when investing.
DeepSeek open-sources V4 model with 1.6 trillion parameters
Recommended Reading




