GitHub - gagolews/stringx: Drop-in replacements for base R string functions powered by stringi (original) (raw)
stringx
Replacements for base R string functions powered by stringi
English is the native language for only 5% of the World population. Also, only 17% of us can understand this text. Moreover, the Latin alphabet is the main one for merely 36% of the total. The early computer era, now a very long time ago, was dominated by the US. Due to the proliferation of the internet, smartphones, social media, and other technologies and media, this is no longer the case. This package replaces base R string functions with ones that fully support the Unicode standards related to natural language and date-time processing. Thanks toICU (International Components for Unicode)and stringi, they are fast, reliable, and portable across different platforms.
stringx replaces baseR functions likepaste
, gregexpr
, tolower
, and strptime
with ones that:
- work in the same way on every platform,
- support a wide range of languages and scripts,
- fix some long-standing inconsistencies in base R (which are pointed out in the packagemanual).
stringx is a set of wrappers aroundstringi — a matureR package for fast, consistent, convenient, and portable string/text/natural language processing in any locale that relies onICU – International Components for Unicode.
To learn more about R, check out Marek's open-access (free!) textbookDeep R Programming.
Package Maintainer and Author:Marek Gagolewski
Homepage: https://stringx.gagolewski.com/
CRAN Entry: https://cran.r-project.org/package=stringx
License:stringx
is distributed under the terms of the GNU General Public License, either Version 2 or Version 3; seeLICENSE.
Changelog: seeNEWS.