The AI research department QVAC under Tether Data recently announced a major upgrade - the official release of the Genesis II dataset. Compared to the previous generation product Genesis I, the new version has added 107 billion tokens, directly increasing the total scale of the publicly available synthetic dataset for educational use to 148 billion tokens.
What does this expansion mean? The coverage has become broader. The new dataset includes content from 19 different fields, which is a significant boost for training more general and precise AI models. From the perspective of data volume, the leap from Genesis I to Genesis II is not just an increase in numbers, but also reflects Tether Data's ongoing investment in the field of AI research.
In the current era of deep integration between AI and Web3, high-quality public datasets of this kind play a role in promoting the entire ecosystem. A larger data scale and richer category coverage mean that developers and researchers can conduct model training and validation based on more complete information. In a sense, this is another step by Tether Data towards opening up resources to the industry and advancing the democratization of AI.
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
The AI research department QVAC under Tether Data recently announced a major upgrade - the official release of the Genesis II dataset. Compared to the previous generation product Genesis I, the new version has added 107 billion tokens, directly increasing the total scale of the publicly available synthetic dataset for educational use to 148 billion tokens.
What does this expansion mean? The coverage has become broader. The new dataset includes content from 19 different fields, which is a significant boost for training more general and precise AI models. From the perspective of data volume, the leap from Genesis I to Genesis II is not just an increase in numbers, but also reflects Tether Data's ongoing investment in the field of AI research.
In the current era of deep integration between AI and Web3, high-quality public datasets of this kind play a role in promoting the entire ecosystem. A larger data scale and richer category coverage mean that developers and researchers can conduct model training and validation based on more complete information. In a sense, this is another step by Tether Data towards opening up resources to the industry and advancing the democratization of AI.