All Projects → PacktPublishing → Natural-Language-Processing-Python-and-NLTK

PacktPublishing / Natural-Language-Processing-Python-and-NLTK

Licence: MIT license
Natural Language Processing Python and NLTK by Packt

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Natural Language Processing Python and NLTK

Code repository for Natural Language Processing Python and NLTK

##What You Will Learn:

  • Get a glimpse of the complexity of natural languages and how they are processed by machines
  • Clean and wrangle text using tokenization and chunking to help you better process data
  • Tokenize text into sentences, and sentences into words
  • Classify text and perform sentiment analysis
  • Implement string matching algorithms and normalization techniques
  • Understand and implement the concepts of information retrieval and text summarization
  • Find out how to implement various NLP tasks in Python

Software and Hardware (Module 1):

Chapter number Software required (with version) Download links to the software Hardware specifications OS required
1-5 Python/Anaconda NLTK https://www.python.org/, http://continuum.io/downloads, http://www.nltk.org/ Common Unix Printing System any
6 scikit-learn and gensim http://scikit-learn.org/stable/, https://radimrehurek.com/gensim/ Common Unix Printing System any
7 Scrappy http://scrapy.org/ Common Unix Printing System any
8 NumPy, SciPy, pandas, and matplotlib http://www.numpy.org/, http://www.scipy.org/, http://pandas.pydata.org/, http://matplotlib.org/ Common Unix Printing System any
9 Twitter Python APIs and Facebook python APIs https://dev.twitter.com/overview/api/twitter-libraries, https://developers.facebook.com Common Unix Printing System any

Software and Hardware (Module 2):

| Chapter number | Software required (with version) | Free/Proprietary | Download links to the software | | -------------- | -------------- |-------------- |-------------- |-------------- | | 1 | NLTK>=3.0a4, NLTK Data | Free | http://www.nltk.org, http://www.nltk.org/data.html | | 2 | pyenchant>=1.6.5 | Free | http://pythonhosted.org/pyenchant/ | | 3 | lockfile>=0.9.1, MongoDB >= 2.6, pymongo>=2.6.3 | Free | https://pypi.python.org/pypi/lockfile, http://www.mongodb.org/, https://pypi.python.org/pypi/pymongo/ | | 4 | NLTK-Trainer >= 0.9 | Free | https://github.com/japerk/nltk-trainer | | 7 | scikit-learn>=0.14.1 | Free |http://scikit-learn.org/stable/ | | 8 | Redis >= 2.8, redis>=2.8.0 , execnet>=1.1 | Free | http://redis.io/, https://pypi.python.org/pypi/redis/, https://codespeak.net/execnet/ | | 9 | python-dateutil>=2.0, beautifulsoup4>=4.3.2, lxml>=3.2.3, charade>=1.0.3 | Free | http://labix.org/python-dateutil, http://www.crummy.com/software/BeautifulSoup/, http://lxml.de/, https://pypi.python.org/pypi/charade |

Software and Hardware (Module 3):

Chapter number Software required (with version) Hardware Specifications OS required
All chapters Python 2.7 or 3.2+ Install NLTK 3.0 either on 32-bit or 64-bit machine Windows or Mac/Unix

###Note Modules 1, 2 and 3 have code arranged by chapter (for the chapters that have code). Click here if you have any feedback or suggestions.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].