Posts

Showing posts from September, 2022

Spelling correction using TextBlob in python.

What is TextBlob? TextBlob is python library for processing textual data. It is built on the top of NLTK module. How to install TextBlob?     1. Using pip:         pip install textblob     2. Using conda:         conda install -c conda-forge textblob Some terms that will be frequently used are : · Corpus – Body of text, singular. · Lexicon – Words and their meanings. · Token – Each “entity” that is a part of whatever was split up based on rules. For examples, each word is a token when a sentence is “tokenized” into words. Each sentence can also be a token, if you tokenized the sentences out of a paragraph. Textblob(text,tokenizer= None, np_extractor=None,pos_tagger=None,analyzer=None,classifier=None):  A general text block, meant for larger bodies of text.     Parameters:      text: string     tokenzier:  (optional) A tokenizer instance. If None, defaults to WordTokenizer().     np_extractor:  (optional) An NPExtractor instance. If None, defaults to FastNPExtractor().     pos_tagger: