25 Best Datasets for NLP

25 Best Datasets for NLP


NLP (General)

  • Enron Dataset
  • Amazon Reviews
  • Google Books Ngrams
  • Blogger Corpus
  • Wikipedia Links Data 
  • Gutenberg eBooks List
  • Jeopardy
  • Hansards Text chunks of Canadian
    Parliament
  • SMS Spam Collection in English



Text Dataset

  • 20 Newsgroups
  • ArXiv
  • Reuters News Dataset
  • The WikiQA Corpus
  • UCl's Spambase
  • Yelp Reviews
  • WordNet
  • The Blog Authorship Corpus



Sentiment Analysis

  • Multidomain Sentiment Analysis Dataset
  • IMDB Reviews
  • Stanford Sentiment Treebank
  • Sentiment140
  • Twitter US Airline Sentiment



Audio Speech Datasets for NLP

  • 2000 HUB5 English
  • LibriSpeech
  • Spoken Wikipedia Corpora
  • Free Spoken Digit Dataset
  • TIMIT

 

If you really like this💯, then follow🌈 me by Clicking Follow💥 button next to comment section.🤩🥰

Stay Connect with me 😃
Thank you 💙😇

Thank you for visiting my blog. My team is here to help you. Let us know if you have any doubts.

Post a Comment (0)
Previous Post Next Post