R Dataset / Package Ecdat / nonEnglishNames

Submitted by pmagunia on March 9, 2018 - 1:06 PM
Dataset License
GNU General Public License v2.0
Attachment Size
dataset-98983.csv 231 bytes
Documentation

Names with Character Set Problems

Description

A data.frame describing names containing character codes rare or non-existent in standard English text, e.g., with various accent marks that may not be coded consistenty in different locales or by different software.

Usage

data(nonEnglishNames)

Format

A data.frame with two columns:

nonEnglish

a character vector containing names that often have non-standard characters with the non-standard characters replaced by "_"

English

a character vector containing a standard English-character translation of nonEnglish

See Also

grepNonStandardCharacters, subNonStandardCharacters

Examples

data(nonEnglishNames)
all.equal(ncol(nonEnglishNames), 2)
--

Dataset imported from https://www.r-project.org.

Documentation License
GNU General Public License v2.0

From Around the Site...

Title Authored on Content type
R Dataset / Package Stat2Data / Kids198 March 9, 2018 - 1:06 PM Dataset
chickwts February 26, 2017 - 11:28 AM Dataset
R Dataset / Package survival / ovarian March 9, 2018 - 1:06 PM Dataset
R Dataset / Package Stat2Data / Hawks March 9, 2018 - 1:06 PM Dataset
R Dataset / Package boot / cd4 March 9, 2018 - 1:06 PM Dataset