lee

Lee1990
Β·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Lee1990's activity

reacted to Locutusque's post with πŸ€—πŸ‘β€οΈ 10 months ago
view post
Post
Introducing the "UltraTextbooks" dataset πŸš€πŸ“š
Check it out here: Locutusque/UltraTextbooks
πŸ“˜ A comprehensive collection of high-quality synthetic and human-written textbooks
πŸ‘¨β€πŸŽ“ Spanning various subjects and programming languages
πŸ”§ Designed for advanced NLP tasks like language modeling, educational QA, text summarization, and content generation for edu purposes
πŸš€ Future expansions planned with additional data sources to enhance the corpus
πŸ‘‡ Data composition highlights πŸ‘‡
- Blend of synthetic and human-written material
- Includes topics from general edu to specialized areas
- Structured with field "text"
🧩 Data collection from various Hugging Face datasets, guided by a diverse and comprehensive curation rationale
🚧 Limitations may exist, so report any issues you encounter
  • 2 replies
Β·