Data Augmentation with Python by Duc Haba
Author:Duc Haba
Language: eng
Format: epub
Publisher: Packt
Published: 2023-11-15T00:00:00+00:00
Figure 5.12 â Netflix keyboard augmenting
Pluto does the same for the Twitter NLP data, as follows:
# use keyboard augmentation technique pluto.print_aug_keyboard(pluto.df_twitter_data, col_dest='clean_tweet', aug_name='Keyboard Augment')
The output is as follows:
Figure 5.13 â Twitter keyboard augmenting
The last of the three text augmentation methods is the random technique.
Random augmenting
The random character function randomly swaps, inserts, or deletes characters in the text. The four modes for the random process are inserting, deleting, substituting, and swapping. The augmentation variable defines as follows:
# define augmentation function variable definition aug_func = nlpaug.augmenter.char.RandomCharAug(action=action)
Pluto uses the print_aug_random() wrapper function with action set to insert in the Netflix NLP data, as follows:
# use random insert augmentation technique pluto.print_aug_char_random(pluto.df_netflix_data, action='insert', col_dest='description', aug_name='Random Insert Augment')
The output is as follows:
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Computer Vision & Pattern Recognition | Expert Systems |
Intelligence & Semantics | Machine Theory |
Natural Language Processing | Neural Networks |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(7858)
Hadoop in Practice by Alex Holmes(5663)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5517)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(4516)
Functional Programming in JavaScript by Mantyla Dan(3725)
The Age of Surveillance Capitalism by Shoshana Zuboff(3429)
Big Data Analysis with Python by Ivan Marin(3063)
Blockchain Basics by Daniel Drescher(2895)
The Rosie Effect by Graeme Simsion(2714)
WordPress Plugin Development Cookbook by Yannick Lefebvre(2619)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2532)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(2485)
Test-Driven Development with Java by Alan Mellor(2478)
Dawn of the New Everything by Jaron Lanier(2441)
Data Augmentation with Python by Duc Haba(2324)
The Art Of Deception by Kevin Mitnick(2301)
The Infinite Retina by Robert Scoble Irena Cronin(2230)
Rapid Viz: A New Method for the Rapid Visualization of Ideas by Kurt Hanks & Larry Belliston(2200)
Human Dynamics Research in Smart and Connected Communities by Shih-Lung Shaw & Daniel Sui(2181)