Kazan Federal University Digital Repository

Natural Language Processing with Python Quick Start Guide: Going from a Python Developer to an Effective Natural Language Processing Engineer.

Show simple item record

dc.contributor.author Kasliwal Nirant.
dc.date.accessioned 2024-01-26T21:36:44Z
dc.date.available 2024-01-26T21:36:44Z
dc.date.issued 2018
dc.identifier.citation Kasliwal. Natural Language Processing with Python Quick Start Guide: Going from a Python Developer to an Effective Natural Language Processing Engineer. - Birmingham: Packt Publishing Ltd, 2018 - 1 online resource (177 pages) - URL: https://libweb.kpfu.ru/ebsco/pdf/1950559.pdf
dc.identifier.isbn 1788994108
dc.identifier.isbn 9781788994101
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/178473
dc.description Weighted classifiers
dc.description.abstract NLP in Python is among the most sought-after skills among data scientists. With code and relevant case studies, this book will show how you can use industry grade tools to implement NLP programs capable of learning from relevant data. We will explore many modern methods ranging from spaCy to word vectors that have reinvented NLP.
dc.description.tableofcontents Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Getting Started with Text Classification; What is NLP?; Why learn about NLP?; You have a problem in mind; Technical achievement; Do something new; Is this book for you?; NLP workflow template; Understanding the problem; Understanding and preparing the data; Quick wins -- proof of concept; Iterating and improving; Algorithms; Pre-processing; Evaluation and deployment; Evaluation; Deployment; Example -- text classification workflow; Launchpad -- programming environment setup
dc.description.tableofcontents Text classification in 30 lines of codeGetting the data; Text to numbers; Machine learning; Summary; Chapter 2: Tidying your Text; Bread and butter -- most common tasks; Loading the data; Exploring the loaded data; Tokenization; Intuitive -- split by whitespace; The hack -- splitting by word extraction; Introducing Regexes; spaCy for tokenization; How does the spaCy tokenizer work?; Sentence tokenization; Stop words removal and case change; Stemming and lemmatization; spaCy for lemmatization; -PRON-; Case-insensitive; Conversion -- meeting to meet; spaCy compared with NLTK and CoreNLP
dc.description.tableofcontents Correcting spellingFuzzyWuzzy; Jellyfish; Phonetic word similarity; What is a phonetic encoding?; Runtime complexity; Cleaning a corpus with FlashText; Summary; Chapter 3: Leveraging Linguistics; Linguistics and NLP; Getting started; Introducing textacy; Redacting names with named entity recognition; Entity types; Automatic question generation; Part-of-speech tagging; Creating a ruleset; Question and answer generation using dependency parsing; Visualizing the relationship; Introducing textacy; Leveling up -- question and answer; Putting it together and the end; Summary
dc.description.tableofcontents Chapter 4: Text Representations -- Words to NumbersVectorizing a specific dataset; Word representations; How do we use pre-trained embeddings?; KeyedVectors API; What is missing in both word2vec and GloVe?; How do we handle Out Of Vocabulary words?; Getting the dataset; Training fastText embedddings; Training word2vec embeddings; fastText versus word2vec; Document embedding; Understanding the doc2vec API; Negative sampling; Hierarchical softmax; Data exploration and model evaluation; Summary; Chapter 5: Modern Methods for Classification; Machine learning for text
dc.description.tableofcontents Sentiment analysis as text classification Simple classifiers; Optimizing simple classifiers; Ensemble methods; Getting the data; Reading data; Simple classifiers; Logistic regression; Removing stop words; Increasing ngram range; Multinomial Naive Bayes; Adding TF-IDF; Removing stop words; Changing fit prior to false; Support vector machines; Decision trees; Random forest classifier; Extra trees classifier; Optimizing our classifiers; Parameter tuning using RandomizedSearch; GridSearch; Ensembling models; Voting ensembles -- Simple majority (aka hard voting); Voting ensembles -- soft voting
dc.language English
dc.language.iso en
dc.publisher Birmingham Packt Publishing Ltd
dc.subject.other Natural language processing (Computer science)
dc.subject.other Python (Computer program language)
dc.subject.other Natural Language Processing
dc.subject.other Python (Computer Program Language)
dc.subject.other Computer Software -- Testing.
dc.subject.other Debugging In Computer Science.
dc.subject.other Computers -- Languages -- Python.
dc.subject.other Computers -- Software Development & Engineering -- Quality Assurance & Testing.
dc.subject.other Traitement automatique des langues naturelles.
dc.subject.other Python (Langage de programmation)
dc.subject.other Programming & scripting languages: general.
dc.subject.other Artificial intelligence.
dc.subject.other Natural language & machine translation.
dc.subject.other Computers -- Intelligence (AI) & Semantics.
dc.subject.other Computers -- Programming Languages -- Python.
dc.subject.other Computers -- Natural Language Processing.
dc.subject.other Natural language processing (Computer science)
dc.subject.other Python (Computer program language)
dc.title Natural Language Processing with Python Quick Start Guide: Going from a Python Developer to an Effective Natural Language Processing Engineer.
dc.type Book
dc.description.pages 1 online resource (177 pages)
dc.collection Электронно-библиотечные системы
dc.source.id EN05CEBSCO05C14


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics