-
Notifications
You must be signed in to change notification settings - Fork 26
/
DESCRIPTION
72 lines (72 loc) · 2.21 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
Package: textclean
Title: Text Cleaning Tools
Version: 0.9.7
Authors@R: c(
person("Tyler", "Rinker", email = "[email protected]", role = c("aut", "cre")),
person("ctwheels", "StackOverflow", role = "ctb"),
person("Surin", "Space", role = "ctb")
)
Maintainer: Tyler Rinker <[email protected]>
Description: Tools to clean and process text. Tools are geared at checking for substrings that
are not optimal for analysis and replacing or removing them (normalizing) with more
analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards
(2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For
example, emoticons are often used in text but not always easily handled by analysis
algorithms. The replace_emoticon() function replaces emoticons with word
equivalents.
Depends: R (>= 3.4.0)
Imports: data.table, english(>= 1.0-2), glue (>= 1.3.0), lexicon (>= 1.0.0), mgsub (>= 1.5.0), qdapRegex,
stringi, textshape(>= 1.0.1), utils
Suggests: hunspell, testthat
License: GPL-2
LazyData: TRUE
RoxygenNote: 7.1.2
Encoding: UTF-8
URL: https://github.com/trinker/textclean
BugReports: https://github.com/trinker/textclean/issues
Collate:
'add_comma_space.R'
'add_missing_endmark.R'
'utils.R'
'replace_html.R'
'check_text_logicals.R'
'check_text.R'
'drop_element.R'
'drop_row.R'
'fgsub.R'
'fix_mdyyyy.R'
'glue-reexports.R'
'has_endmark.R'
'like.R'
'make_plural.R'
'match_tokens.R'
'mgsub.R'
'replace_contraction.R'
'replace_date.R'
'replace_email.R'
'replace_emoji.R'
'replace_emoticon.R'
'replace_grade.R'
'replace_hash.R'
'replace_incomplete.R'
'replace_internet_slang.R'
'replace_kerning.R'
'replace_misspelling.R'
'replace_money.R'
'replace_names.R'
'replace_non_ascii.R'
'replace_number.R'
'replace_ordinal.R'
'replace_rating.R'
'replace_symbol.R'
'replace_tag.R'
'replace_time.R'
'replace_to.R'
'replace_tokens.R'
'replace_url.R'
'replace_white.R'
'replace_word_elongation.R'
'strip.R'
'sub_holder.R'
'swap.R'
'textclean-package.R'