Text Analytics with Python: A Practical Real-World Approach to Gaining Actionable Insights from your Data

  • 6h 39m
  • Dipanjan Sarkar
  • Apress
  • 2016

Derive useful insights from your data using Python. You will learn both basic and advanced concepts, including text and language syntax, structure, and semantics. You will focus on algorithms and techniques, such as text classification, clustering, topic modeling, and text summarization.

Text Analytics with Python teaches you the techniques related to natural language processing and text analytics, and you will gain the skills to know which technique is best suited to solve a particular problem. You will look at each technique and algorithm with both a bird's eye view to understand how it can be used as well as with a microscopic view to understand the mathematical concepts and to implement them to solve your own problems.

What You Will Learn:

  • Understand the major concepts and techniques of natural language processing (NLP) and text analytics, including syntax and structure
  • Build a text classification system to categorize news articles, analyze app or game reviews using topic modeling and text summarization, and cluster popular movie synopses and analyze the sentiment of movie reviews
  • Implement Python and popular open source libraries in NLP and text analytics, such as the natural language toolkit (nltk), gensim, scikit-learn, spaCy and Pattern

Who This Book Is For :

IT professionals, analysts, developers, linguistic experts, data scientists, and anyone with a keen interest in linguistics, analytics, and generating insights from textual data

About the Author

Dipanjan Sarkar is a Data Scientist at Intel, the world's largest silicon company which is on a mission to make the world more connected and productive. He primarily works on analytics, business intelligence, application development and building large scale intelligent systems. He received his master's degree in Information Technology from the International Institute of Information Technology, Bangalore with a focus on data science and software engineering. He has been an analytics practitioner for over 4 years now specializing in statistical, predictive and text analytics. He has also authored a book on Machine Learning with R and occasionally reviews technical books.

In this Book

  • Natural Language Basics
  • Python Refresher
  • Processing and Understanding Text
  • Text Classification
  • Text Summarization
  • Text Similarity and Clustering
  • Semantic and Sentiment Analysis