• 1 Introduction
  • 2 Read entity data
  • 3 Name length
    • 3.1 last_name
    • 3.2 first_name
    • 3.3 midl_name
  • 4 name_sufx_cd
  • 5 Standardisation
    • 5.1 Lower-case letters.
    • 5.2 Non-alphanumeric
      • 5.2.1 Hyphen
      • 5.2.2 Quote
      • 5.2.3 Period
      • 5.2.4 Comma
      • 5.2.5 Other non-alphanumeric
    • 5.3 Digits
      • 5.3.1 Zero
      • 5.3.2 One
      • 5.3.3 Other digits
    • 5.4 Special words
      • 5.4.1 Last name
      • 5.4.2 First name
      • 5.4.3 Middle name
  • Timing

Last updated: 2021-04-04

Checks: 7 0

Knit directory: fa_sim_cal/

This reproducible R Markdown analysis was created with workflowr (version 1.6.2). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.


Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.

The command set.seed(20201104) was run prior to running the code in the R Markdown file. Setting a seed ensures that any results that rely on randomness, e.g. subsampling or permutations, are reproducible.

Great job! Recording the operating system, R version, and package versions is critical for reproducibility.

Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.

Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version 2da01ed. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .Rhistory
    Ignored:    .Rproj.user/
    Ignored:    .tresorit/
    Ignored:    _targets/
    Ignored:    data/VR_20051125.txt.xz
    Ignored:    data/VR_Snapshot_20081104.txt.xz
    Ignored:    output/blk_char.fst
    Ignored:    output/ent_blk.fst
    Ignored:    output/ent_cln.fst
    Ignored:    output/ent_raw.fst
    Ignored:    renv/library/
    Ignored:    renv/local/
    Ignored:    renv/staging/

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.


These are the previous versions of the repository in which changes were made to the R Markdown (analysis/m_01_8_check_name.Rmd) and HTML (docs/m_01_8_check_name.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File Version Author Date Message
Rmd 2da01ed Ross Gayler 2021-04-04 WIP
html 2da01ed Ross Gayler 2021-04-04 WIP

# NOTE this notebook can be run manually or automatically by {targets}
# So load the packages required by this notebook here
# rather than relying on _targets.R to load them.

# Set up the project environment, because {workflowr} knits each Rmd file 
# in a new R session, and doesn't execute the project .Rprofile

library(targets) # access data from the targets cache

library(tictoc) # capture execution time
library(here) # construct file paths relative to project root
library(fs) # file system operations
library(vroom) # fast reading of delimited text files
library(tibble) # enhanced data frames
library(stringr) # string matching
library(skimr) # compact summary of each variable
library(lubridate) # date parsing

Attaching package: 'lubridate'
The following objects are masked from 'package:base':

    date, intersect, setdiff, union
library(forcats) # manipulation of factors
library(ggplot2) # graphics
library(tidyr) # data tidying

# start the execution time clock
tictoc::tic("Computation time (excl. render)")

# Get the path to the raw entity data file
# This is a target managed by {targets}
f_entity_raw_tsv <- tar_read(c_raw_entity_data_file)

1 Introduction

The aim of this set of meta notebooks is to work out how to read the raw entity data. and get it sufficiently neatened so that we can construct standardised names and modelling features without needing any further neatening. To be clear, the target (c_raw_entity_data) corresponding to the objective of this set of notebooks is the neatened raw data, before constructing any modelling features.

This notebook documents the checking of the name variables for any issues that need fixing.

These variables will be used to construct the main predictors in the compatibility models.

Regardless of whether there are any issues that need to be fixed, the analyses here may inform our use of these variables in later analyses.

Define the name variables.

  • last_name - Voter last name
  • first_name - Voter first name
  • midl_name - Voter middle name
  • name_sufx_cd - Voter name suffix
vars_name <- c(
  "last_name", "first_name", "midl_name", "name_sufx_cd" 
)

2 Read entity data

Read the raw entity data file using the previously defined functions raw_entity_data_read(), raw_entity_data_excl_status(), raw_entity_data_excl_test(), raw_entity_data_drop_novar(), raw_entity_data_parse_dates(), and raw_entity_data_drop_cancel_dt().

# Show the data file name
fs::path_file(f_entity_raw_tsv)
[1] "VR_20051125.txt.xz"
d <- raw_entity_data_read(f_entity_raw_tsv) %>% 
  raw_entity_data_excl_status() %>% 
  raw_entity_data_excl_test() %>% 
  raw_entity_data_drop_novar() %>% 
  raw_entity_data_parse_dates() %>% 
  raw_entity_data_drop_cancel_dt()
dim(d)
[1] 4099699      24

Take a quick look at the distributions.

d %>% skimr::skim()
Table 2.1: Data summary
Name Piped data
Number of rows 4099699
Number of columns 24
_______________________
Column type frequency:
character 23
Date 1
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
county_id 0 1.00 1 3 0 100 0
county_desc 0 1.00 3 12 0 100 0
voter_reg_num 0 1.00 12 12 0 1786064 0
last_name 0 1.00 1 21 0 191996 0
first_name 23 1.00 1 19 0 126589 0
midl_name 252695 0.94 1 20 0 175742 0
name_sufx_cd 3869063 0.06 1 3 0 101 0
house_num 0 1.00 1 6 0 27534 0
half_code 4088996 0.00 1 1 0 41 0
street_dir 3812561 0.07 1 2 0 8 0
street_name 7 1.00 1 30 0 83244 0
street_type_cd 154594 0.96 2 4 0 119 0
street_sufx_cd 3941004 0.04 1 3 0 11 0
unit_num 3755239 0.08 1 7 0 16116 0
res_city_desc 19 1.00 3 20 0 783 0
state_cd 18 1.00 2 2 0 5 0
zip_code 21 1.00 5 9 0 902 0
area_cd 2628117 0.36 1 3 0 507 0
phone_num 2540990 0.38 1 7 0 1072592 0
sex_code 0 1.00 1 1 0 3 0
sex 0 1.00 3 6 0 3 0
age 0 1.00 1 3 0 135 0
birth_place 718647 0.82 2 2 0 56 0

Variable type: Date

skim_variable n_missing complete_rate min max median n_unique
registr_dt 0 1 1899-09-14 9999-10-21 1997-01-09 18249
  • last_name 100% filled
  • first_name ~100% filled (23 missing)
  • midl_name 94% filled
  • name_sufx_cd 6% filled

3 Name length

Look at the distributions of name lengths first, before moving on to analyses more focused on standardisation.

Calculate the lengths of the name variables.

x <- d %>% 
  dplyr::mutate(
    len_last = stringr::str_length(last_name),
    len_first = stringr::str_length(first_name),
    len_midl = stringr::str_length(midl_name)
  )

3.1 last_name

last_name Voter last name

Look at the distributions of name lengths.

summary(x$len_last)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  1.000   5.000   6.000   6.345   7.000  21.000 
table(x$len_last, useNA = "ifany")

      1       2       3       4       5       6       7       8       9      10 
     18    2046   53580  393363  864542 1094952  805773  514347  212379   96777 
     11      12      13      14      15      16      17      18      19      20 
  33039   12034    6844    4239    2679    1632     824     404     152      73 
     21 
      2 
x %>% 
  ggplot() +
  geom_histogram(aes(x = len_last), binwidth = 1) +
  scale_y_sqrt()

Version Author Date
2da01ed Ross Gayler 2021-04-04

Look at examples of short names.

# length == 1
x %>% 
  dplyr::filter(len_last == 1) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
A CHUH NA SHADY GROVE
A THEK NA SWANEE
H MOIH NA GREENBRIAR
J J NA LIVE OAK
K HOA HIEP MALLEN
K NGEO NA MALLEN
K NIUH NA BINGHAM
K RICHARD V CROSSRIDGE
K SANG NA WESTERWOOD VILLAGE
M COY FAY NOCARVA
N RENEE VIVIAN ROBINSON
R ANDREW PERNELL GRINDSTAFF
R MARY NA MOUNT PLEASANT CHURCH
S PETER THOMAS ROXBORO
U RAYMOND NA WARLEY
X MARCUS NA CRESCENT
X WILLIE LARRY BAXTER
Y PRUM NA GREENBRIAR
  • 1-letter last names are very rare
  • 1-letter last names are probably errors
# length == 2
x %>% 
  dplyr::filter(len_last == 2) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
AR ORAWAN P GLENCLIFF
DO HANH THUAN T LARKSPUR
EA YOUNG HOW CHARLOTTE
EO YONG SUK DONOVAN
HA YONG S DRESDEN
KO LINDA KYONGSUK OLD FAYETTEVILLE
LE ANDREW CHAU PINE
LE DANH MINH AMBERFIELD
LE DU D WATERVIEW
LE NANCY NICHOLS LADFORD
LE QUANG TRAN WILSHIRE
LU IAN MICHAEL PINKNEY CHURCH
MA ARNOLD M SUMMERHILL RIDGE
MA JAMES SUNG KAO BARON MONCK PASS
NG AMY L0CKAMY PARKSTONE
VO KHANH HUU HILLTOP
WU KUY M BOSSWOOD
YI JI SUK NA COUNTRY
YU JUN HYUK MILLSTONE
YU XIAO LI LONGBRIDGE
  • Most 2-letter last names are probably valid.
  • ST is probably Saint from a multi-word last name

Look at examples of long names.

# length == 21
x %>% 
  dplyr::filter(len_last == 21) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ALESSANDRETTI-STRAUSS MARIA E ALCLARE
BREWINGTON-SUTHERLAND LISA A BELLS VALLEY
  • 21-letter last names are hyphenated
# length >= 20
x %>% 
  dplyr::filter(len_last >= 20) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ANASTASIOU-JOSEPHIDE THEODORA A LONG PINE
ARDESHIRPOUR-ZARTOSH PARVIZ NA DUNKIRK
ARRIAGADA-VALENZUELA GONZALO ESTEBAN BENCHMARK
BEDINGFIELD-DEMATTEO HOLLIS BEDINGFIELD NAVAHO
BEN MESSAOUDMESSAOUD AHMED BEN FUGATE
FERRIOLA-BRUCKENSTEI ZACHARY NA PINEHURST
FRANKFORT-WINNINGHAM SUSAN R MOUNTAIN LAKE
GERKHARDT-GODZIEMSKI ALICE ELIZABETH WHITNEY
HUDSON-CHARLES-PIERR MONIQUE NA LONGLEAF
KACZMAREK-HUFFSTETLE KIM NA MUSES
KLOCZKOWSKI-BERTRAND DAWN M HOMESTEAD PARK
MCCUTCHEON-GUTKNECHT LISA ANN SHAW FARM
MORRISON-WESTMORELAN DAWN IRVING MOUNTAIN
NOOHLANHLA GUGULETHE ALAMILLA NA HODGE
SCHIAPPACASSE-DEPUTY STEPHANIE E AMBLESIDE
SOTELO DE LOS SANTOS MARCOS ANTONIO FOX
THEODORDES-GRINESTAF APRIL ARLETHA FERN COVE
THEODORIDES-GRINESTA APRIL ARLETHA FERN COVE
VALL-SPINOSA PERKINS JESSIE FAYE KINLOCKE
WASHINGTON-HALFKENNY DAVID D DAVIS MILL
  • 20+-letter last names appear to be multi-word and/or hyphenated

3.2 first_name

first_name Voter first name

Look at the distributions of name lengths.

summary(x$len_first)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
  1.000   5.000   6.000   5.913   7.000  19.000      23 
table(x$len_first, useNA = "ifany")

      1       2       3       4       5       6       7       8       9      10 
   8070    3799   99236  525505 1077727 1018768  884199  295743  135014   19359 
     11      12      13      14      15      16      17      18      19    <NA> 
  29314    1487     880     345     215       9       4       1       1      23 
x %>% 
  ggplot() +
  geom_histogram(aes(x = len_first), binwidth = 1) +
  scale_y_sqrt()
Warning: Removed 23 rows containing non-finite values (stat_bin).

Version Author Date
2da01ed Ross Gayler 2021-04-04

Look at the missing names.

x %>% 
  dplyr::filter(is.na(first_name)) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ALEXANDER NA JASON ARLEE
AMEN NA NA BERRYMEADOW
BULLARD NA ALEXIS THIRD
BURGESS NA NA HORACE PERRY
CHESTER NA JAMES ASH LANDING
ELSASS NA NA RIVER FALLS
FRISBY NA M CABARRUS GRAVES DORM
FRYE WILLIAM C NA NA GATES FOREST
FUQUA NA MARY NC HIGHWAY 62 S
FUQUA NA WILLIAM NC HIGHWAY 62 S
GRAYWOLF NA NA OTIS
JUDITH NA NA HILLSIDE
KAUCHICK NA PAULINE BAKERS MILL
MAGENTA NA NA WINDYRUSH
MALIK NA NA ASHBY WOODS
MCKEEL NA LESTER BIG DADDYS
MOLET NA MICHAEL PINE FOREST
MORRIS NA ALEXANDER WOODWAY OAK
PATTERSON NA JOHN DEXTER PINECREST
PHOENIX NA NA MIRACLE MOUNTAIN
SILVERMOON NA NA SUNLIGHT
WARREN NA NA J AND S
ZIMMER NA CLIFFORD GREEN SPRINGS
  • Some missing first names look like the middle name is actually the first name, e.g. ? JASON ALEXANDER
  • Some missing first names appear to have only a last name, e.g. ? ? AMEN
  • Some missing first names appear to have the entire name in the last name variable, e.g. ? ? FRYE WILLIAM C

Look at examples of short names.

# length == 1
x %>% 
  dplyr::filter(len_first == 1) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ANDREWS A E WALNUT CREEK
BARNETTE C V FREEMAN
BENFIELD J D BENFIELD
BOONE A C 5TH
BOSTWICK H KATHLEEN HWY 28
CAMPBELL W THOMAS IRWIN
HERMAN L E OLD WILKESBORO RD EXT
HOOKER S A MIDWAY SCHOOL
MCDONALL W B MRS CLOVERDALE
MILLER J H WATSON
MILTON E D STRATHDON
OVERMAN R DALE HERRING
REIDENBACH W SCOTT GRAY MARES
SMITH J C MORTON
STRAUB C WINIFRED BATTS
TOWNSEND J B BAMBOO
TUTTLE M GERTRUDE BAUX MOUNTAIN
WIGGINS J BELTON NC HIGHWAY 55
WILLIAMS A BLANDENA WATER PLANT
WILLIFORD W T JACKSON
  • The 1-letter first names appear to be using an initial as the first name
# length == 2
x %>% 
  dplyr::filter(len_first == 2) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
BOONE JO GRAHAM CLIFF
BROOKS SU TONYA CARYETTE BROOKSDALE
CALVILLO AJ NA CANTERBURY
CLARK JO W BEN NEVIS
FAIR JD FAIR HOSPITAL
FAULHABER JO ANN STONEYBROOKE
FOWLER LA SONDA ROCK
HARVEY TY NA SPRING
HOFF MI YONG FIELDCREST
JUDD JO D PINE
MCKEE JO SHUMATE JONQUIL
MILLER AL NA STRAWBERRY
MIMS JO CHANDLER BURNSIDE
MULLEN JO L BATTS
NGUYEN HO NGOC JARRETT FARM
NICHOLSON DE MELVIN RINGWOOD
TANG YU NA FOUR ACRE
THOMASON JO CARPENTER 31ST AV
WHICHARD AL NA O’FARRELL
WON UN T QUARTZ HILL

2-letter first names appear to be:

  • Valid, e.g. JO W CLARK, HO NGOC NGUYEN
  • Part of a multi word name that has bee split across the first and middle name variables, e.g. LA SONDA FOWLER

Look at the long names.

# length >= 16
x %>% 
  dplyr::filter(len_first >= 16) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ANDERSON MICHAEL-CHEROKEE DEMCK FARMRIDGE
DOUPE KIMBERLY DANIELLE WYATT WEBB COVE
ENRIQUEZ MARIA DEL CARMEN NA LODGE SOUTH
FIELDS ADRIENNE`FELICIA NA WESTNEDGE
LAPPAS-KOTARA MICHELLE-ADRIENNE NA COUNTRY CLUB
MIDDLESWORTH ELIZABETH-LINDSAY MCCOY PECAN
NAGARAJ SANTHEBACHAHALLI S ROYAL TROON
NATARAJA HEGGADADEVANAKOTE NA MIMOSA PARK
NGUYEN THI PHUONG KHAUH NA FARMGATE
NUNEZ MARIANA DE JESUS N REDDMAN
ODEMS MICHAEL-CHRISTOPHER NA CLAIBORNE WOODS
PERRY SHIRLEY ANN-PEPPER NA ALPHA
RODRIGUEZ MARIA DEL CARMAN NA SUMMER HOUSE
SUBRAMANIAM LAKSHMINARAYANAN NA PARKS FARM
WINKLER ELIZABETH PORTIS G CARRIAGE LAMP

Long first names appear to be:

  • Long non-anglo names, e.g. LAKSHMINARAYANAN
  • Multi-word and/or hyphenated, e.g. ELIZABETH-LINDSAY

3.3 midl_name

midl_name Voter middle name

These names will often be missing or initials only.

Look at the distributions of name lengths.

summary(x$len_midl)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
   1.00    3.00    5.00    4.73    6.00   20.00  252695 
table(x$len_midl, useNA = "ifany")

     1      2      3      4      5      6      7      8      9     10     11 
826716  10491 289439 440549 651587 705383 508158 227267 114306  30604  20536 
    12     13     14     15     16     17     18     19     20   <NA> 
  9807   5186   3514   3379     50     21      8      2      1 252695 
x %>% 
  ggplot() +
  geom_histogram(aes(x = len_midl), binwidth = 1) +
  scale_y_sqrt()
Warning: Removed 252695 rows containing non-finite values (stat_bin).

Version Author Date
2da01ed Ross Gayler 2021-04-04
  • Many records are missing middle name
  • Spike of 1-letter names will be initials

Look at the long names.

# lentgh >= 16
x %>% 
  dplyr::filter(len_midl >= 16) %>% 
  dplyr::select(ends_with("_name")) %>% 
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, first_name) %>% 
  knitr::kable()
last_name first_name midl_name street_name
ARTIST SYLVIA JOYCE WIILIAMSON SHATTALON
BILLOW LESLEY ELIZABETH CLAUSS BRIDGE POINTE
BOWDEN CORA FRANCES THOMPSON CENTRAL
BRINKHUIS VANESSA INGRID-PRISCILLA WINDING WOODS
CALL LUNIA ANNTONIA MCCRARY RURITAN
DELLA MEA CAROLYN ROBINSON COUNTRY VIEW
EXUM SHEILA LANENA WHITEHEAD PEELE
GANAWAY SUSAN ANN WINTERHALTER GATESHEAD
GULLEY JOHN MARCUS DELAFAYETTE POLO
HARRIS ANN PULLER- MARCELINE-ZO BALDY MURPHY
HARTSFIELD NAOMI RUTH SATTERWHITE BUDDYS
HICKS NELLIE BEATRICE-RICHARDSON PERSIMMON
HOGGARD ANN DENISE HARRINGTON JESSE HOGGARD
MOORE VIDA GWENEVERE BARNER GILMER
RIVERA RAFAEL ANTONIO CARAMBOT GRAVEYARD
ROGERS RUBYE REBECCA/SUDDRETH HOLLY
SWINSON MARY ELIZABETH FRANCIS HOLLOMAN
WHITENER STEPHANIE LYNNE WARREN PARKER FOX HILL
WOOD T BENBURY HAUGHTON GREENFIELD
YOUNGER ZEE CAMILLE PREVETTE STATESVILLE
  • Long middle names appear to be multiple names and/or hyphenated
# clean up
rm(x)
gc()
            used  (Mb) gc trigger   (Mb)  max used   (Mb)
Ncells   4764654 254.5    7213358  385.3   6237960  333.2
Vcells 114486777 873.5  251180768 1916.4 209250640 1596.5

4 name_sufx_cd

name_sufx_cd Voter name suffix

This is intended for generation markers, e.g. Junior, Senior.

I am not going to use name suffix in entity resolution because age should be sufficient and is much better quality. I will look at what values turn up in the name suffix because the same values sometimes wrongly occur in the main name variables. Knowing what values occur may help us to remove those values from the main name variables.

d %>% dplyr::select(name_sufx_cd) %>% skimr::skim()
Table 4.1: Data summary
Name Piped data
Number of rows 4099699
Number of columns 1
_______________________
Column type frequency:
character 1
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
name_sufx_cd 3869063 0.06 1 3 0 101 0
table(d$name_sufx_cd, useNA = "ifany") %>% sort() %>% rev()

   <NA>      JR     III      SR      II      IV     JR.     SR.       I       V 
3869063  153804   29605   27494   14043    3682    1060     226     218     190 
    111     MRS      11      VI       `     VII     MR.     MS.       J       E 
     67      50      28      27      13       9       7       5       5       4 
     MR       C       W     SCO       S     REV       R       N       M      JD 
      3       3       2       2       2       2       2       2       2       2 
    DR.       D     ANN       0     (JR       X     WAL     VIR     TOB     Sr. 
      2       2       2       2       2       1       1       1       1       1 
    SMI     SAM     REE     RAY       Q     PLA       P      ON      OD       O 
      1       1       1       1       1       1       1       1       1       1 
     MS     MOO     MMO      MD     MCQ     MAC     LOC     LLL      LL     LEW 
      1       1       1       1       1       1       1       1       1       1 
    LEE     LAR       L     KIT     KEN       K     JR,     JAC     ING     ILI 
      1       1       1       1       1       1       1       1       1       1 
    II.       H     GUY     GLE       G     FOR     FAU     F M      EY     EWA 
      1       1       1       1       1       1       1       1       1       1 
    ELS     DOR      DO     DIC     CUB     CHA       B     ALB     AJR       A 
      1       1       1       1       1       1       1       1       1       1 
    8TH       5     3RD      39     346       2      1V      15     134     070 
      1       1       1       1       1       1       1       1       1       1 
     \\     (II 
      1       1 
# get a better look at the cleaned suffixes
d %>% 
  dplyr::mutate(
    sufx = name_sufx_cd %>% 
      stringr::str_to_upper() %>% 
      stringr::str_remove_all(pattern = "[^A-Z0-9]") %>% # remove non-alphanumeric
      dplyr::na_if("") 
  ) %>% 
  dplyr::count(sufx) %>% 
  dplyr::filter(n > 1) %>% 
  dplyr::arrange(desc(n), sufx) %>% 
  knitr::kable()
sufx n
NA 3869077
JR 154867
III 29605
SR 27721
II 14045
IV 3682
I 218
V 190
111 67
MRS 50
11 28
VI 27
MR 10
VII 9
MS 6
J 5
E 4
C 3
0 2
ANN 2
D 2
DR 2
JD 2
M 2
N 2
R 2
REV 2
S 2
SCO 2
W 2
  • There are generation suffixes: JR, SR, I, II (11), III (111), IV, V, VI, VII
  • There are honorific titles: MRS, MR, MS, DR, REV

5 Standardisation

Look at issues that might be addressed by standardisation.

For each type of standardisation issue look at first middle and last names separately, because the issue may manifest differently in each of the name variables.

5.1 Lower-case letters.

d %>% dplyr::select(last_name) %>%
  dplyr::filter(stringr::str_detect(last_name, "[a-z]"))
# A tibble: 3 x 1
  last_name       
  <chr>           
1 MacQUEEN        
2 MacQUEEN        
3 BROWN-McCULLOUGH
d %>% dplyr::select(first_name) %>%
  dplyr::filter(stringr::str_detect(first_name, "[a-z]"))
# A tibble: 11 x 1
   first_name
   <chr>     
 1 JoANN     
 2 LaVERNE   
 3 JoANNE    
 4 JoANN     
 5 SiROBERT  
 6 McCKINES  
 7 DeNEAL    
 8 McHILDIA  
 9 JoANN     
10 LaSONYA   
11 JeROME    
d %>% dplyr::select(midl_name) %>%
  dplyr::filter(stringr::str_detect(midl_name, "[a-z]"))
# A tibble: 76 x 1
   midl_name  
   <chr>      
 1 McBRIDE    
 2 McBRIDE    
 3 McKINNIE   
 4 McLAWHORN  
 5 McKEITHAN  
 6 McCULLEN   
 7 MacFRANKLIN
 8 McQUEEN    
 9 McPHAIL    
10 McCULLEN   
# … with 66 more rows
  • Lower case letters occur in last, first, and middle names
  • Associated with particles where there would optionally be a space, e.g. JoANN, McBride

5.2 Non-alphanumeric

Check for non-alphanumeric characters in names.

5.2.1 Hyphen

Check for hyphens.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "-"))

nrow(x)
[1] 20543
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
32 DURHAM 000000661780 BAZAN-MANSON ANDREA NA NA 809 NA NA BERKELEY ST NA NA DURHAM NC 27705 NA NA F FEMALE 37 MA 1995-06-23
32 DURHAM 000030026101 BENITEZ-GRAHAM ANA NA NA 21 NA NA SUNRISE PL NA NA DURHAM NC 27705 NA NA F FEMALE 34 TX 2001-07-20
98 WILSON 000057487343 BERRY-DANIEL VAUGHN NA NA 98 NA NA WILSHIRE BLVD NA NA WILSON NC 27893 252 2433141 F FEMALE 44 NC 2004-11-02
8 BERTIE 000000001037 BIBB-FREEMAN TIFFANY OCTAVIA NA 1408 NA NA EARLY STATION RD NA NA AULANDER NC 27805 252 3454679 F FEMALE 23 VA 2000-03-24
60 MECKLENBURG 000001221875 COALE-KRUPA MARY KITTY NA 617 NA N MAIN ST NA NA DAVIDSON NC 28036 NA NA F FEMALE 47 MD 2002-08-20
90 UNION 000000145759 EVERETT-GIGLIO SUZANNE MARIE NA 908 NA NA COACHMAN DR NA NA WAXHAW NC 28173 NA NA F FEMALE 42 NA 1998-07-06
60 MECKLENBURG 000001141414 HARRISON-LAMPTEY JAMES CHARLES NA 10443 NA NA KATIE CREEK CT NA 310 CHARLOTTE NC 28213 NA NA M MALE 56 NA 2000-09-06
41 GUILFORD 000000412591 JEAN-PIERRE HERBERT NA NA 5409 NA NA HIDDENBROOK DR NA NA MCLEANSVILLE NC 27301 NA NA M MALE 64 NA 2000-09-19
60 MECKLENBURG 000000935651 JENKINS-JAMES TREVA NA NA 928 NA NA GARRINGER PL NA NA CHARLOTTE NC 28208 704 7130173 F FEMALE 37 NC 1995-03-06
60 MECKLENBURG 000099847059 JOHNSON-DILLENBECK LINDA JEAN NA 4704 NA NA WILSHAM CT NA NA CHARLOTTE NC 28226 NA NA F FEMALE 58 OH 2004-11-09
26 CUMBERLAND 000066043921 JOHNSON-FORBES SHAQUILLA NICOLE JOHNSON NA 123 A S 6TH ST NA NA SPRING LAKE NC 28390 919 4786807 F FEMALE 18 NY 2005-02-28
60 MECKLENBURG 000000979771 MACK-PURNELL JOYCE A NA 4113 NA NA WILSON LN NA NA CHARLOTTE NC 28206 NA NA F FEMALE 36 SC 1996-04-10
92 WAKE 000031196782 MANNING-SHAUB CHERYL NA NA 1608 NA NA CARSON ST NA D RALEIGH NC 27608 NA NA F FEMALE 50 NY 2003-04-11
92 WAKE 000031297021 POLLARD-GREIF RHIANNON ELIZABETH NA 4117 NA NA LANDFALL CT NA NA RALEIGH NC 27613 NA NA F FEMALE 21 TX 2004-11-02
60 MECKLENBURG 000000701437 RADFORD-BLACK ANITA NA NA 2913 NA NA BRAEWOOD PL NA NA CHARLOTTE NC 28226 NA NA F FEMALE 50 NA 1984-07-01
17 CASWELL 000000005474 RICHMOND-GRAVES VANESSA D NA 10122 NA NA NC HIGHWAY 62 N NA NA NA MILTON NC 27305 NA NA F FEMALE 49 VA 1974-03-25
26 CUMBERLAND 000000194608 SHERRELL-PATTERSON KRYSTAL ANN NA 9105 NA NA DURANT NIXON RD NA NA LINDEN NC 28356 910 9801908 F FEMALE 27 NC 2000-03-16
74 PITT 000030005579 STONE-TANENBERG KAREN ANNE NA 1866 NA NA BLUE BANKS FARM RD NA NA GREENVILLE NC 27834 NA NA F FEMALE 60 PA 2000-06-05
86 SURRY 000030000468 TILLEY-VARELA MYRA AMANDA NA 204 NA NA WINDSOR PARK DR NA NA DOBSON NC 27017 336 3868771 F FEMALE 34 AR 1999-12-09
19 CHATHAM 000000046875 WIMBISH-VANDERBECK LAURA NA NA 1510 NA NA CHICKEN BRIDGE RD NA NA PITTSBORO NC 27312 919 5452160 F FEMALE 54 NA 1998-03-18
  • ~21k last names with hyphens
  • Look like legitimately hyphenated last names
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "-"))

nrow(x)
[1] 3011
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
92 WAKE 000031052249 ROSAVAGE ANN-MARIE NA NA 4600 NA NA TWISTED OAKS DR NA 1505 RALEIGH NC 27612 NA NA F FEMALE 59 PA 1999-02-12
32 DURHAM 000030047988 CRENSHAW CALLIE-ANNE DOANE NA 1434 NA NA HOLLY HILL DR NA NA DURHAM NC 27713 NA NA F FEMALE 24 AZ 2003-09-23
92 WAKE 000031080971 GLENN CHIH-TZU L NA 5409 NA NA WALTON HILL RD NA NA KNIGHTDALE NC 27545 NA NA F FEMALE 49 OC 2000-02-22
45 HENDERSON 000000058272 WOODARD ESTHER-JOAN SURRETT NA 90 NA NA EVELYNS WAY NA NA FLAT ROCK NC 28731 828 6988599 F FEMALE 61 NC 1994-04-11
90 UNION 000000178600 ROUSSEAUX JEAN-CLAUDE CHRISTIAN NA 8109 NA NA SHANNON WOODS LN NA NA MATTHEWS NC 28104 NA NA M MALE 68 OC 2003-06-03
92 WAKE 000010382004 BERARD JEAN-PAUL NA NA 1239 NA NA PATRICK CIR NA C CARY NC 27511 NA NA M MALE 35 CT 1997-10-13
96 WAYNE 000000054092 ARTIS JO-ANN NA NA 201 NA NA MEADOWCREST DR NA NA GOLDSBORO NC 27534 919 7515253 F FEMALE 47 NC 1993-01-07
11 BUNCOMBE 000000223119 FURLONG JO-ANN ALICE NA 104 NA NA GALLOWAY DR NA NA ASHEVILLE NC 28803 828 6549363 F FEMALE 52 DC 2000-03-31
41 GUILFORD 000000626310 DALE JON-MARC RYAN NA 2215 NA NA GLENSIDE DR NA NA GREENSBORO NC 27405 NA NA M MALE 22 NC 2004-10-07
41 GUILFORD 000000630891 COOK KAWIKA-JAMAL SAMUEL NA 830 NA W MARKET ST NA NA GREENSBORO NC 27401 NA NA M MALE 20 HI 2004-10-08
49 IREDELL 000009757984 MILLER LEE-JAMIL K NA 178 NA NA SEVEN OAKS LN NA NA STATESVILLE NC 28677 NA NA M MALE 19 NC 2004-08-11
12 BURKE 000000082451 BEAVER RUTH-ANNE GUST NA 4301 NA NA BECK RD NA NA MORGANTON NC 28655 828 4398144 F FEMALE 42 FL 2004-02-05
92 WAKE 000031119510 YEUNG SHIN-YIING NA NA 205 NA NA CHAMBERLAIN ST NA NA RALEIGH NC 27607 NA NA F FEMALE 27 OC 2000-10-13
92 WAKE 000031125506 JAN SHYI-TAI NA NA 127 NA NA MALDON DR NA NA CARY NC 27513 NA NA M MALE 46 OC 2000-10-31
92 WAKE 000031076587 CHU TE-HSIN A NA 1625 NA NA POINT ROYAL DR NA NA FUQUAY VARINA NC 27526 NA NA F FEMALE 29 OC 1999-11-08
76 RANDOLPH 000000113057 SU TSUNG-HU NA NA 218 NA NA ALISON LN NA NA ARCHDALE NC 27263 NA NA F FEMALE 43 OC 2000-06-20
92 WAKE 000031275564 MOJICA WILLIAM-JOSEPH KAILI NA 0 NA NA NCSU CARROLL NA NA NA RALEIGH NC 27607 NA NA M MALE 20 TX 2004-09-30
92 WAKE 000031246605 TSAI WON-WHEI NA NA 8505 NA NA DELAVAN PL NA NA RALEIGH NC 27613 NA NA F FEMALE 43 OC 2004-07-22
68 ORANGE 000000230724 TAPP YOUNG-SUK O NA 1615 NA NA MT CARMEL CHURCH RD NA NA CHAPEL HILL NC 27517 NA NA F FEMALE 37 OC 2005-04-01
92 WAKE 000010068959 CHANG YU-JHI(JULIE) CHEN NA 403 NA NA BATHGATE LN NA NA CARY NC 27513 NA NA F FEMALE 54 OC 1988-01-01
  • ~3kL first names with hyphens
  • Look like legitimately hyphenated first names
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "-"))

nrow(x)
[1] 3883
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
16 CARTERET 000000072750 CABRIELE DEBRA ANN-MARIE NA 294 NA NA MARSHALLBERG RD NA NA SMYRNA NC 28579 NA NA F FEMALE 19 PA 2004-10-04
90 UNION 000000146024 CONRAD HEATHER CLEONA-JANE NA 2919 NA NA FAIRCROFT WAY NA NA MONROE NC 28110 NA NA F FEMALE 28 NA 1998-07-20
67 ONSLOW 000000340532 REEVES ANDREW DAVID-JOEL NA 110 NA NA PARNELL RD NA NA HUBERT NC 28539 910 4514985 M MALE 26 FL 2003-08-08
32 DURHAM 000030082304 BROOKS DARRICK E-ALSTON NA 108 NA N GUTHRIE AVE NA C DURHAM NC 27703 NA NA M MALE 26 NC 2004-11-02
64 NASH 000000075076 EBRAHIM EMAD ELDIN-YASHAR NA 2417 NA NA HURT DR NA NA ROCKY MOUNT NC 27804 252 4433745 M MALE 46 OC 2003-12-09
11 BUNCOMBE 000000173561 WILSON KAY FRANCES-LAWS NA 275 NA NA PISGAH VIEW RD NA NA ASHEVILLE NC 28806 NA NA F FEMALE 56 NC 1995-02-23
95 WATAUGA 000000056892 LANE MICHELLE GAYE-PRESTON NA 1877 NA NA LEE GUALTNEY RD NA NA BANNER ELK NC 28604 828 8986303 F FEMALE 45 NA 1999-10-08
92 WAKE 000031150141 YUAN DEREK HAW-LUEN NA 709 NA NA EVANVALE CT NA NA CARY NC 27511 NA NA M MALE 22 IL 2001-10-19
100 YANCEY 000000006601 PATE BARBARA JEAN-DALE NA 139 NA NA PROFFITT BRANCH RD NA NA NA BURNSVILLE NC 28714 NA NA F FEMALE 35 NC 1988-02-03
32 DURHAM 000030069361 CHAN GODWIN KWOK-YIN NA 5110 NA NA OLD CHAPEL HILL RD NA 1304 DURHAM NC 27707 NA NA M MALE 22 OC 2004-09-17
100 YANCEY 000000018225 HUGHES RACHEL LYNN-INGRAM NA 192 NA NA POLLARD MINE RD NA NA NA BURNSVILLE NC 28714 828 6750517 F FEMALE 29 NC 2004-09-30
26 CUMBERLAND 000000133376 MORTON QUIANA MAISHA-ANN NA 7500 NA NA BEVERLY DR NA NA FAYETTEVILLE NC 28314 910 8601544 F FEMALE 28 AZ 1996-08-15
60 MECKLENBURG 000001339612 SAYE ROBYN MOO-YOUNG NA 3731 NA NA ASHTON DR NA NA CHARLOTTE NC 28210 NA NA F FEMALE 28 NY 2004-10-08
65 NEW HANOVER 000000231998 DIXON STANLEY RAY-HAMILTON NA 802 NA NA RED LIGHTHOUSE LN NA NA WILMINGTON NC 28412 910 5403709 M MALE 29 NC 2004-12-30
58 MARTIN 000000027338 PARKER BRANDON SHON-DAY NA 1137 NA NA SOUTH STEVENSON RD NA NA WILLIAMSTON NC 27892 252 8091222 M MALE 20 NC 2003-10-29
51 JOHNSTON 000033060504 CYRAN JACLYN SUZANNE-MARIE NA 112 NA NA WHITLEY DR NA NA SMITHFIELD NC 27577 919 9897647 F FEMALE 18 NY 2005-05-19
92 WAKE 000031140534 BOWMAN HELEN TAUSSIG-HAUPT NA 209 NA NA COVINGTON SQUARE DR NA NA CARY NC 27513 NA NA F FEMALE 38 VA 2001-06-28
19 CHATHAM 000000101935 BELL ROSE TIEH-CHIN NA 30000 NA NA VILLAGE PARK DR NA NA CHAPEL HILL NC 27517 919 9330577 F FEMALE 61 OC 2004-10-01
81 RUTHERFORD 000030002029 COLDREN RUTH VIOLA-SHEATS NA 156 NA NA KENT DR NA NA FOREST CITY NC 28043 828 2457097 F FEMALE 83 PA 2000-08-10
68 ORANGE 000000090785 KIM LEGIA YOUNG-SON NA 211 NA NA HENDERSON ST NA NA CHAPEL HILL NC 27514 NA NA F FEMALE 37 VA 1988-10-06
  • ~4k middle names with hyphens
  • Look like legitimately hyphenated middle names

5.2.2 Quote

Check for quotes.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "'"))

nrow(x)
[1] 4920
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
11 BUNCOMBE 000000287934 D’ARBEAU STEPHEN B NA 609 B NA WALTON ST NA NA ASHEVILLE NC 28801 828 2802431 M MALE 57 NY 2005-10-05
60 MECKLENBURG 000001320093 I’ANSON-JACKSON JENNIFER NA NA 1820 NA NA INTERFACE LN NA 101 CHARLOTTE NC 28262 704 2818718 F FEMALE 37 NA 2004-10-01
32 DURHAM 000000351140 O’BRIANT MABEL S NA 2110 NA NA SUMMIT ST NA NA DURHAM NC 27707 NA NA F FEMALE 87 NC 1968-04-01
19 CHATHAM 000000098930 O’BRIEN MARK S NA 537 NA NA LASHLEY RD NA NA CHAPEL HILL NC 27516 919 4692551 M MALE 30 RI 2004-06-22
23 CLEVELAND 000001002784 O’BRIEN PATRICK WAYNE SR 101 NA NA STOCKTON ST NA NA LATTIMORE NC 28089 NA NA M MALE 23 GA 2002-09-23
27 CURRITUCK 000000013831 O’BRIEN WILLIAM PATRICK NA 143 NA NA NORTH POINT BLVD NA NA MOYOCK NC 27958 NA NA M MALE 69 WV 1997-02-28
31 DUPLIN 000000040238 O’CONNELL TINA DEE NA 312 NA NA MAGNOLIA ST NA NA ROSE HILL NC 28458 910 2894766 F FEMALE 39 NC 1996-12-13
60 MECKLENBURG 000000624796 O’MALLEY-HELMS COLLEEN E NA 9227 NA NA KINGS CANYON DR NA NA CHARLOTTE NC 28210 NA NA F FEMALE 45 NA 1991-05-01
92 WAKE 000031253314 O’MEARA MORGAN STUART NA 4409 NA NA LIVERPOOL LN NA NA RALEIGH NC 27604 NA NA M MALE 25 TN 2004-08-19
74 PITT 000030027632 O’NEAL DORIS TAYLOR NA 100 NA NA HICKORY ST NA NA GREENVILLE NC 27858 NA NA F FEMALE 92 NC 2003-11-17
74 PITT 000030039863 O’NEAL KELLY NA NA 212 NA NA FLEMING HALL NA NA NA GREENVILLE NC 27858 NA NA F FEMALE 19 NC 2004-10-08
98 WILSON 000000046621 O’NEAL BETTY MAGALENE D NA 4937 NA NA YANK RD NA NA WILSON NC 27893 252 2390768 F FEMALE 45 NC 1992-12-11
60 MECKLENBURG 000001082517 O’NEAL TAKINA L NA 11130 NA NA LEADENHALL LN NA NA CHARLOTTE NC 28262 NA NA F FEMALE 25 SC 1998-08-25
92 WAKE 000009904041 O’NEAL LINDA F NA 405 NA S ARENDELL AVE NA NA ZEBULON NC 27597 NA NA F FEMALE 49 NC 1984-01-01
86 SURRY 000000038498 O’NEAL CHARLES FRANKLIN NA 309 NA NA PINEY GROVE RD NA NA MOUNT AIRY NC 27030 336 3864645 M MALE 47 NC 1992-04-06
71 PENDER 000000030006 O’NEIL DONNA LOUISE NA 786 NA NA NC HWY 210 NA W NA HAMPSTEAD NC 28443 NA NA F FEMALE 25 NC 2000-04-05
32 DURHAM 000000625036 O’NEILL LUCILLE W NA 2206 NA W KNOX ST NA B DURHAM NC 27705 NA NA F FEMALE 78 NC 1992-02-28
32 DURHAM 000030076610 O’QUINN VICKIE LEE NA 2717 NA NA SPARGER RD NA NA DURHAM NC 27705 NA NA F FEMALE 43 WV 2004-10-07
60 MECKLENBURG 000000515108 O’ROURKE JOHN F NA 5911 NA NA BEAVER DAM LN NA NA CHARLOTTE NC 28227 NA NA M MALE 61 NA 1984-10-01
92 WAKE 000031269650 O’ROURKE JEFFERY JAMES NA 104 NA NA BROOKTREE CT NA NA MORRISVILLE NC 27560 NA NA M MALE 43 OH 2004-09-27
  • ~5k last names with quotes
  • Look like legitimately quoted last names
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "'"))

nrow(x)
[1] 1226
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
35 FRANKLIN 000000025291 KNIGHT A’NDREA LANIER NA 455 NA NA PERRY RD NA NA ZEBULON NC 27597 NA NA F FEMALE 36 NC 1996-08-02
92 WAKE 000031276525 RICHARDSON ANDRE’ STEVEN NA 0 NA NA NCSU BRAGAW NA NA NA RALEIGH NC 27607 NA NA M MALE 35 NC 2004-10-01
17 CASWELL 000000000140 CORBETT D’ANDREA PERE NA 493 NA NA ROSS ACRES RD NA 2 BURLINGTON NC 27217 NA NA F FEMALE 48 NC 1982-05-22
36 GASTON 000007555976 PASOUR D’ETTA TAYLOR NA 108 NA NA HILLSIDE DR NA NA DALLAS NC 28034 NA NA F FEMALE 34 NC 1994-10-17
76 RANDOLPH 000000130751 JONES DEONTAE’ QUINN NA 3372 NA NA US HWY 220 BUS NA S NA ASHEBORO NC 27205 NA NA M MALE 19 OC 2004-05-04
38 GRAHAM 000000006507 EDWARDS DESIRE’ DENISE NA 0 NA NA SWEETWATER RD NA NA ROBBINSVILLE NC 28771 828 NA F FEMALE 28 NC 1996-08-08
98 WILSON 000057489210 WILKINS FAR’D HAKEEM NA 8633 NA NA RANSOM LN NA NA KENLY NC 27542 919 2846763 M MALE 18 NJ 2005-08-23
91 VANCE 000000026510 RICHARDSON J’MAINE NMN NA 410 NA W ANDREWS AVE NA NA HENDERSON NC 27536 252 4929347 M MALE 27 NC 1996-01-04
42 HALIFAX 000000067780 ALSTON J’MIA KAE NA 1800 NA NA REESES RD NA NA HALIFAX NC 27839 252 5364983 F FEMALE 18 NC 2005-04-18
60 MECKLENBURG 000000569293 DUNLAP JA’TINA R NA 6731 NA NA MONTEZUMA TRL NA NA CHARLOTTE NC 28227 NA NA F FEMALE 37 NA 1988-10-01
32 DURHAM 000000683083 SUITT L’TONYA NA NA 5 NA NA BENTWOOD PL NA NA DURHAM NC 27703 NA NA F FEMALE 27 NC 1996-09-27
39 GRANVILLE 000000041403 LITTLEJOHN LA’KANYA MICHELLE NA 7087 NA NA BAYBERRY DR NA NA OXFORD NC 27565 NA NA F FEMALE 26 NC 2004-02-10
60 MECKLENBURG 000001277012 HALL LA’KETTA CHENTAL NA 608 NA N CALDWELL ST NA 307 CHARLOTTE NC 28202 704 5639085 F FEMALE 24 NA 2004-06-07
80 ROWAN 000000108801 DOWELL LA’TONYA YVETTE NA 806 NA NA PINE ST NA NA SALISBURY NC 28144 NA NA F FEMALE 25 TN 2004-09-07
32 DURHAM 000030079184 FORD O’DENA NA NA 4603 NA NA HOPE VALLEY RD NA E DURHAM NC 27707 NA NA F FEMALE 32 SC 2004-10-08
24 COLUMBUS 000000005988 JACKSON O’NEIL NA NA 303 NA W SMITH ST NA NA CHADBOURN NC 28431 NA NA M MALE 67 NC 1969-08-16
78 ROBESON 000000451402 HILBURN O’NEILL NA NA 590 NA S ELIZABETH ST NA NA ST PAULS NC 28384 NA NA M MALE 56 NC 2002-09-10
49 IREDELL 000005211000 LEBEAU RENE’ DOMITIEN NA 141 NA NA BONNIE LN NA NA STATESVILLE NC 28625 NA NA M MALE 64 NA 1979-12-17
42 HALIFAX 000000067645 WEATHERBEE SADE’ SHANNON NA 1099 NA NA NC HWY 561 NA NA NA SCOTLAND NECK NC 27874 NA NA F FEMALE 19 NC 2005-02-28
60 MECKLENBURG 000001024462 MITCHELL SHANTAE’ T NA 4915 NA NA HOBBS HILL DR NA NA CHARLOTTE NC 28212 NA NA F FEMALE 37 NC 1996-10-01
  • ~1k first names with quotes
  • Look like legitimately quoted first names
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "'"))

nrow(x)
[1] 3152
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
39 GRANVILLE 000000042946 GRILLO LUIS CHE’ NA 404 NA NA LINDEN AVE NA NA OXFORD NC 27565 NA NA M MALE 32 DC 2004-09-07
13 CABARRUS 000030031942 CRAWFORD NICKOLAS D’ANDRE NA 707 NA NA COOPERS RIDGE DR NA NA KANNAPOLIS NC 28083 NA NA M MALE 20 NC 2004-10-01
1 ALAMANCE 000009066220 LARSEN HEATHER D’ANN NA 2705 B NA BERMUDA TRL NA NA MEBANE NC 27302 336 5785009 F FEMALE 25 NC 1999-09-20
60 MECKLENBURG 000000732603 SINGHATEH NICHELLE DY’VONNE NA 5200 NA NA PINE DR NA NA CHARLOTTE NC 28269 704 6055265 F FEMALE 37 NC 1990-08-01
23 CLEVELAND 000000064612 HARPER DUNSEY LA’TAZE NA 765 NA NA BROAD ST NA NA SHELBY NC 28152 NA NA M MALE 35 NC 1994-10-31
60 MECKLENBURG 000001306534 BATTLE IKEDA LE’RECIA NA 5937 NA NA PROVIDENCE RD NA 7 CHARLOTTE NC 28226 NA NA F FEMALE 25 NA 2004-09-17
92 WAKE 000031155934 O’CONNELL KAREN O’BRIEN NA 12329 NA NA CHASTAIN DR NA NA RALEIGH NC 27614 NA NA F FEMALE 43 NJ 2002-01-29
74 PITT 000000122240 SPERRY ANN O’BRIEN NA 539 NA NA CRESTLINE BLVD NA NA GREENVILLE NC 27834 NA NA F FEMALE 45 VA 1996-10-10
36 GASTON 000007613636 GINYARD DEEDRICK O’BRIEN NA 728 NA NA STAGECOACH RD NA NA GASTONIA NC 28052 NA NA M MALE 19 NC 2004-03-05
92 WAKE 000010126163 ARNEY KATHLEEN O’DWYER NA 11901 NA NA OLD CREEDMOOR RD NA NA RALEIGH NC 27613 NA NA F FEMALE 45 IL 1992-01-01
76 RANDOLPH 000000082672 DOWNES ANN O’HARA NA 4801 NA NA WHISPER OAK DR NA NA TRINITY NC 27370 NA NA F FEMALE 52 MA 1992-09-30
68 ORANGE 000000227469 VANHOOK BRANDON O’NEAL NA 118 NA NA ALDERMAN DORM NA NA NA CHAPEL HILL NC 27514 NA NA M MALE 21 NC 2004-10-04
29 DAVIDSON 000000115972 JONES ROBIN O’NEIL NA 200 NA NA OLDE FOX TRL NA NA LEXINGTON NC 27292 NA NA F FEMALE 34 NA 1998-12-15
25 CRAVEN 000000066985 JOHNSON DAESHAWAN O’NEIL NA 955 NA NA ADAMS CREEK RD NA NA HAVELOCK NC 28532 252 4470876 M MALE 25 NC 1998-05-29
60 MECKLENBURG 000001147528 MANNS RUSSELL O’NEIL JR 1802 NA NA BRAY DR NA NA CHARLOTTE NC 28214 704 3989747 M MALE 41 VA 2000-10-02
11 BUNCOMBE 000000263326 CALLOWAY TOMIKA RENEE’ NA 305 NA NA WEAVERVILLE RD NA 1 ASHEVILLE NC 28804 828 6454768 F FEMALE 22 NC 2004-06-02
32 DURHAM 000000621881 THOMAS DENA RENEE’ NA 5704 NA NA HADRIAN DR NA NA DURHAM NC 27703 NA NA F FEMALE 33 NC 1991-09-27
25 CRAVEN 000000073455 CALLAHAN PAMELA RENEE’ NA 103 NA NA LEWIS FARM RD NA 3 NEW BERN NC 28560 NA 0000000 F FEMALE 25 NA 2000-03-21
10 BRUNSWICK 000001447563 GRAHAM QUINDERIA SH’RON NA 4439 NA NA SMITH TRL NA NA LELAND NC 28451 910 6554499 F FEMALE 21 NC 2002-05-22
60 MECKLENBURG 000001305918 JAMES BRITTANY VONT’E NA 1925 NA NA TERRYBROOK LN NA 2 CHARLOTTE NC 28205 704 6057935 F FEMALE 22 NA 2004-09-17
  • ~3k middle names with quotes
  • Look like legitimately quoted middle names

5.2.3 Period

Check for periods.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "\\."))

nrow(x)
[1] 11
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
5 ASHE 000000014095 BINGHAM JR. AMES EDMOND NA 0 NA NA ASHE PARK RD NA NA WEST JEFFERSON NC 28694 NA NA M MALE 32 CO 1994-07-26
68 ORANGE 000000121958 DAYE JR. JAMES NA JR 501 NA NA LANCASTER RD NA NA MEBANE NC 27302 NA NA M MALE 31 NC 1992-11-04
59 MCDOWELL 000000037352 RUSSELL, JR. KERMITT PATRICK NA 1008 NA NA GOOSE CREEK RD NA NA MARION NC 28752 828 7383705 M MALE 36 NC 1995-11-13
82 SAMPSON 000000489470 ST. CLAIR JACK LEE NA 4480 NA NA GIDDENSVILLE RD NA NA FAISON NC 28341 910 5940169 M MALE 54 NA 1997-09-09
26 CUMBERLAND 000000128083 ST. CYR CANDICE NICOLE NA 6681 NA NA WINCHESTER ST NA NA FAYETTEVILLE NC 28314 910 8686454 F FEMALE 31 OK 1996-05-07
60 MECKLENBURG 000001321254 ST. GEORGE MARTHA S NA 733 NA NA PLANTATION ESTATES DR NA 306 MATTHEWS NC 28105 704 8499492 F FEMALE 86 NC 2004-10-04
60 MECKLENBURG 000001325127 ST. GERMAIN AMY NA NA 2506 NA NA WHITEHALL ESTATES DR NA G CHARLOTTE NC 28273 NA NA F FEMALE 23 NA 2004-10-06
60 MECKLENBURG 000001308643 ST. JOHN JESSICA JO NA 1035 NA NA YORKMONT RIDGE LN NA 818 CHARLOTTE NC 28217 704 3298055 F FEMALE 29 NA 2004-09-21
66 NORTHAMPTON 000000014466 ST. LAWRENCE ELIZABETH W NA 200 NA NA HAMPTON WOODS RD NA NA JACKSON NC 27845 NA NA F FEMALE 76 CT 2000-10-13
60 MECKLENBURG 000001307743 ST.CLAIRE KEVIN WAYNE NA 7835 NA NA EULER WAY NA NA CHARLOTTE NC 28214 704 3980037 M MALE 32 LA 2004-09-17
60 MECKLENBURG 000001300705 ST.JOHN JOANN DIMAGGIO NA 12708 NA NA WILLINGDON RD NA NA HUNTERSVILLE NC 28078 704 9483586 F FEMALE 47 NA 2004-09-15
  • 11 last names with periods
  • Look like legitimate abbreviations
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "\\."))

nrow(x)
[1] 120
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
51 JOHNSTON 000003056783 NORRIS A.T. NA NA 5876 NA NA STRICKLANDS CROSSROADS NA NA NA FOUR OAKS NC 27524 NA NA M MALE 72 NC 2000-10-09
39 GRANVILLE 000000005309 EVINS BETTY L. CURRIN NA 4066 NA NA OUR RD NA NA OXFORD NC 27565 NA NA F FEMALE 52 NC 1972-04-01
62 MONTGOMERY 000000011173 BUIE BEVERLY D. COOKE NA 549 NA NA SHOE FACTORY RD NA NA MT GILEAD NC 27306 NA NA F FEMALE 51 NC 1972-03-18
60 MECKLENBURG 000001334959 DUNN C. SHAY HARRELSON NA 4001 NA NA BELLINGHAM LN NA NA CHARLOTTE NC 28215 704 5967613 F FEMALE 37 NC 2004-10-08
60 MECKLENBURG 000001292537 SWOFFORD D. MILYNN NA 13702 NA NA ROBERT WALKER DR NA NA DAVIDSON NC 28036 610 7724025 F FEMALE 48 NA 2004-08-16
91 VANCE 000000005688 ROSS E. TRAVIS JR 325 NA NA WOODLAND RD NA NA HENDERSON NC 27536 NA NA M MALE 69 NC 1968-10-19
39 GRANVILLE 000000014863 INGOLFSSON E. JUANITA O’BRIEN NA 207 NA NA GILLIS ST NA NA STOVALL NC 27582 NA NA F FEMALE 85 NC 1978-03-11
89 TYRRELL 000000001129 LILLEY G. C. NA 402 NA NA LIGHT ST NA NA COLUMBIA NC 27925 NA NA M MALE 80 NA 1965-10-16
68 ORANGE 000000154826 NAVARRE J. RICHARD II 27 NA NA BLUFF TRL NA NA CHAPEL HILL NC 27516 NA NA M MALE 38 MI 1996-10-08
59 MCDOWELL 000000011205 JARRETT J. REID NA 1259 NA NA ASHWORTH RD NA NA MARION NC 28752 NA NA M MALE 83 NC 1900-01-01
89 TYRRELL 000000000701 AINSLEY J. (JULIUS) T.(THOMAS) NA 10155 NA NA NEWLANDS RD NA NA COLUMBIA NC 27925 NA NA M MALE 65 NA 1971-01-30
36 GASTON 000002721700 RENDLEMAN J.T. NA NA 309 NA N LEE ST NA NA MOUNT HOLLY NC 28120 NA NA M MALE 78 NC 1968-10-07
62 MONTGOMERY 000000003700 GIBSON M. COLINE NA 731 NA NA LANDFILL RD NA NA MT GILEAD NC 27306 NA NA F FEMALE 72 NC 1976-10-04
39 GRANVILLE 000000005468 HICKS MARY E. PALMER NA 109 NA NA JONES ST NA NA OXFORD NC 27565 NA NA F FEMALE 84 NC 1972-02-28
62 MONTGOMERY 000000009295 UNDERWOOD NORMA J. PHILLIPS NA 1334 NA NA ELDORADO ST NA NA TROY NC 27371 NA NA F FEMALE 43 NC 1980-03-27
62 MONTGOMERY 000000019715 GARSKA P.J. JAN DE BEWR NA 38 NA NA CLUBHOUSE DR NA NA NEW LONDON NC 28127 336 4615376 F FEMALE 75 NA 1998-09-28
62 MONTGOMERY 000000005915 USSERY PRISCILLA B. SANDERS NA 651 NA NA HYDRO RD NA NA MT GILEAD NC 27306 NA NA F FEMALE 58 NC 1970-04-11
89 TYRRELL 000000002506 BOUCHER T. RENEE NA NA 210 NA N VIRGINIA AVE NA NA COLUMBIA NC 27925 NA NA F FEMALE 25 NA 1998-04-08
36 GASTON 000007522626 GABLE THOMAS J. WESTLEY NA 2780 NA NA XRAY DR NA NA GASTONIA NC 28054 NA NA M MALE 76 NC 1998-08-28
68 ORANGE 000000149269 MUSSON W. JAMES NA 2309 NA NA WYNGATE DR NA NA HILLSBOROUGH NC 27278 NA NA M MALE 51 NA 1996-08-19
  • 120 first names with periods
  • Look like initials
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "\\."))

nrow(x)
[1] 2233
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
22 CLAY 000000002813 KYKER JACOB C. NA 114 NA NA CHERRYWOOD DR NA NA HAYESVILLE NC 28904 NA NA M MALE 38 NC 1994-10-13
11 BUNCOMBE 000000222731 TOWNSEND IRIS D. MELENDEZ NA 153 NA NA BURTON ST NA NA ASHEVILLE NC 28806 NA NA F FEMALE 39 NA 2000-03-27
42 HALIFAX 000000007007 FLOYD WALTER E. NA 331 NA NA WILLIAMS ST NA NA ROANOKE RAPIDS NC 27870 NA NA M MALE 68 NA 1968-10-12
66 NORTHAMPTON 000000010637 WARMACK WILLIAM G. NA 326 NA NA STANFORD ST NA NA RICH SQUARE NC 27869 NA NA M MALE 54 NA 1972-01-04
91 VANCE 000000005000 MASON PEGGY H. NA 1734 NA NA PARKER LN NA NA HENDERSON NC 27536 252 4384482 F FEMALE 68 NC 1965-10-16
36 GASTON 000007532682 ENOCH WILLIAM H. NA 118 NA NA WITHROW DR NA NA DALLAS NC 28034 NA NA M MALE 101 NC 1990-09-15
91 VANCE 000000014135 MITCHELL JOHN J. NA 117 NA NA VILLAGE DR NA NA HENDERSON NC 27536 252 4383279 M MALE 51 NC 1983-05-29
91 VANCE 000000004259 HAMILTON MILON J. NA 2221 NA NA WOODLAND RD NA NA HENDERSON NC 27536 252 4924481 M MALE 82 NC 1972-07-31
91 VANCE 000000005338 PARROTT ULYSSES J. JR 1204 NA NA ROANOKE AVE NA NA HENDERSON NC 27536 NA NA M MALE 78 NC 1964-05-02
26 CUMBERLAND 000000113313 NEWBLE ANDRE L.K. NA 304 NA NA CARTWRIGHT DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 40 NA 1995-10-02
47 HOKE 000000013457 LENAHAN LOIS M. NA 341 NA NA HARRIS LN NA NA RAEFORD NC 28376 NA NA F FEMALE 72 WI 1995-12-22
42 HALIFAX 000000057121 JONES TANETTA M. NA 1407 NA NA GRACE ST NA NA SCOTLAND NECK NC 27874 NA NA F FEMALE 24 NA 2000-04-05
22 CLAY 000000004587 PHILLIPS HELEN M. NA 8140 NA NA HIGHWAY 64 NA E NA HAYESVILLE NC 28904 NA NA F FEMALE 77 NC 1970-10-10
22 CLAY 000000003883 MOORE DOROTHY M. NA 154 NA NA FORT HEMBREE ST NA NA HAYESVILLE NC 28904 NA NA F FEMALE 76 NC 1970-04-18
42 HALIFAX 000000020200 EDWARDS STANLEY M. NA 908 NA NA COUNTRY CLUB RD NA NA WELDON NC 27890 NA NA M MALE 67 NA 1970-04-14
22 CLAY 000000000621 BROOKS MARILYN M.LEDFORD NA 69 NA NA MIDDLE VALLEY DR NA NA HAYESVILLE NC 28904 NA NA F FEMALE 49 NC 1994-03-21
91 VANCE 000000007573 WAVERLY TRACY R. NA 407 NA NA JANE AVE NA NA HENDERSON NC 27536 252 4925047 F FEMALE 36 NC 1988-09-17
22 CLAY 000000005032 SCOTT HENRY R. NA 475 NA NA PINE LOG CHURCH RD NA NA BRASSTOWN NC 28902 NA NA M MALE 57 KS 1984-10-05
47 HOKE 000000016798 GRAHAM KENDRA T. NA 740 NA NA ROCKFISH RD NA NA RAEFORD NC 28376 NA NA F FEMALE 30 NC 1998-01-16
47 HOKE 000000005710 ESPERGREN MARY T. NA 0 NA NA ROLLING HLS NA NA RAEFORD NC 28376 NA NA F FEMALE 74 SC 1980-08-15
  • ~2k middle names with periods
  • Look like initials

5.2.4 Comma

Check for commas.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, ","))

nrow(x)
[1] 2
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
59 MCDOWELL 000000037355 FILLINGHAM, II ROBERT E NA 0 NA NA HILLVIEW DR NA NA MARION NC 28752 828 6527120 M MALE 53 OH 1995-11-21
59 MCDOWELL 000000037352 RUSSELL, JR. KERMITT PATRICK NA 1008 NA NA GOOSE CREEK RD NA NA MARION NC 28752 828 7383705 M MALE 36 NC 1995-11-13
  • 2 last names with commas
  • Punctuation for suffix field values added to last name
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, ","))

nrow(x)
[1] 4
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
59 MCDOWELL 000000023296 PHILLIPS FRANK, NA JR 252 NA NA CHEROKEE WAY NA NA MARION NC 28752 NA NA M MALE 50 NC 1900-01-01
59 MCDOWELL 000000038045 HICKS MARION, NA SR 195 NA NA CLINTON LN NA NA MARION NC 28752 828 7384792 M MALE 58 NC 1996-04-09
59 MCDOWELL 000000045948 CANIPE NOAH, NA JR 1245 NA NA ASHWORTH RD NA NA MARION NC 28752 828 6597728 M MALE 70 NC 2000-11-27
59 MCDOWELL 000000031434 MCADAMS WILL,JR NA NA 145 NA NA HIGHLAND DR NA NA MARION NC 28752 828 6529266 M MALE 56 NC 1991-10-12
  • 4 first names with commas
  • Arbitrary added punctuation
  • Punctuation for suffix field value added to first name
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, ","))

nrow(x)
[1] 12
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
66 NORTHAMPTON 000000013298 FAUCETTE JESSE EDWARD, J NA 0 NA NA NORTH SHORE SUBDIVISION NA NA NA HENRICO NC 27842 252 5377332 M MALE 66 NA 2000-07-03
66 NORTHAMPTON 000000014351 BRASWELL ROBERT ELLIS, J NA 0 NA NA HICKORY HOLLOW RD NA NA GASTON NC 27832 NA NA M MALE 47 NA 1990-02-05
66 NORTHAMPTON 000000006286 MARTIN LLOYD FRANKLIN, S NA 111 NA E JEFFERSON ST NA NA JACKSON NC 27845 NA NA M MALE 49 NA 1974-04-05
66 NORTHAMPTON 000000011718 GAY ROBERT HENRY, III. NA 312 NA S MAIN ST NA NA SEABOARD NC 27876 252 5891914 M MALE 32 NA 1998-11-25
66 NORTHAMPTON 000000004165 FERGUSON STANTON HYDE, J NA 307 NA NA CRAIGE ST NA NA GASTON NC 27832 NA NA M MALE 57 NC 1972-03-25
66 NORTHAMPTON 000000014995 CLARK COLEMAN JACKSON, I NA 0 NA NA SAFFOLD ST NA NA CONWAY NC 27820 NA NA M MALE 36 NA 1990-03-30
66 NORTHAMPTON 000000015301 BARNES RUSSELL JOSEPH, J NA 104 NA NA MULBERRY ST NA NA WOODLAND NC 27897 NA NA M MALE 71 NA 1990-04-08
62 MONTGOMERY 000000011042 PIERCE RUTH P, NA 230 NA NA JUBAL REEVES CIR NA NA MT GILEAD NC 27306 NA NA F FEMALE 75 NC 1976-10-02
77 RICHMOND 000000004857 COVINGTON EDNA(MRS PERRY, JR) NA 804 NA NA ASPEN LN NA NA ROCKINGHAM NC 28379 NA NA F FEMALE 0 CO 1970-04-18
62 MONTGOMERY 000000005799 SCARBOROUGH JOHN R, NA 2730 NA NA NC HWY 109 NA S NA MT GILEAD NC 27306 NA NA M MALE 82 NC 1970-04-04
66 NORTHAMPTON 000000015604 SHEARIN ANDREW THOMAS, S NA 204 NA NA WILSON ST NA NA GARYSBURG NC 27831 252 5363497 M MALE 49 NA 2001-01-21
66 NORTHAMPTON 000000013607 WILLIAMS ERVIN W., SR., NA 0 NA NA ALSTON ST NA NA GARYSBURG NC 27831 252 5362884 M MALE 38 NA 2000-09-20
  • 12 middle names with periods
  • List separator
  • Punctuation to squeeze in extra field

5.2.5 Other non-alphanumeric

Check for other non-alphanumeric characters.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "[^ a-zA-Z0-9\\.,'-]"))

nrow(x)
[1] 31
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) # %>% 
# A tibble: 20 x 24
   county_id county_desc voter_reg_num last_name           first_name midl_name
   <chr>     <chr>       <chr>         <chr>               <chr>      <chr>    
 1 26        CUMBERLAND  000000152206  "BOYLES`"           LINDA      BROWN    
 2 26        CUMBERLAND  000000041392  "BRYANT`"           WILLIAM    STEWART  
 3 12        BURKE       000000041577  "COLLINS/SISK"      RHONDA     L        
 4 60        MECKLENBURG 000001308276  "D*AMICO"           PATRICIA   MARIE    
 5 60        MECKLENBURG 000001327305  "D*AMICO"           MEGAN      MARIE    
 6 14        CALDWELL    000001461244  "GALINSKY/MALAGUTI" DANA       ANNE     
 7 60        MECKLENBURG 000001331730  "GOSHEN\\"          DIXIE      M        
 8 60        MECKLENBURG 000001322929  "LA\"BEE"           DELACRUZ   <NA>     
 9 12        BURKE       000000043552  "MARTIN/HUFF"       ELLEN      MARIE    
10 26        CUMBERLAND  000000133396  "MORRISON`"         HAZEL      M        
11 20        CHEROKEE    000000026270  "NICHOLS/BROWN"     MARY       SUE      
12 60        MECKLENBURG 000001334267  "O*BRIEN"           COLIN      JAMES    
13 60        MECKLENBURG 000001302073  "O*TOOLE"           PETER      TERRENCE 
14 12        BURKE       000000039331  "REAVIS/LONG"       SHAWN      MICHELLE 
15 12        BURKE       000000001293  "RHONEY/PETERS"     DONNA      <NA>     
16 41        GUILFORD    000000621540  "SCHERM%MARTIN"     WYATT      <NA>     
17 60        MECKLENBURG 000001186215  "STRTHEIT\\"        LOLA       C        
18 12        BURKE       000000041072  "TALBERT/GRAHAM"    BRENDA     <NA>     
19 12        BURKE       000000064351  "TUCKER/JACKSON"    LAVONDA    LYNN     
20 89        TYRRELL     000000002842  "WOODARD`"          JASON      WARREN   
# … with 18 more variables: name_sufx_cd <chr>, house_num <chr>,
#   half_code <chr>, street_dir <chr>, street_name <chr>, street_type_cd <chr>,
#   street_sufx_cd <chr>, unit_num <chr>, res_city_desc <chr>, state_cd <chr>,
#   zip_code <chr>, area_cd <chr>, phone_num <chr>, sex_code <chr>, sex <chr>,
#   age <chr>, birth_place <chr>, registr_dt <date>
  # knitr::kable() # some of the characters break the kable formatting
  • 31 last names with other non-alphanumeric characters
  • Most look like substitutions for hyphen or quote
  • Some look like random cruft
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "[^ a-zA-Z0-9\\.,'-]"))

nrow(x)
[1] 102
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(first_name, sex) # %>% 
# A tibble: 20 x 24
   county_id county_desc voter_reg_num last_name first_name         midl_name
   <chr>     <chr>       <chr>         <chr>     <chr>              <chr>    
 1 2         ALEXANDER   000000009950  POTEAT    "(KAY)"            ANNE CATH
 2 60        MECKLENBURG 000001304452  FIELDS    "ADRIENNE`FELICIA" <NA>     
 3 60        MECKLENBURG 000001270308  STEELE    "AR`KISHA"         FERNESE  
 4 60        MECKLENBURG 000001314968  JACKSON   "AR`MONIE"         <NA>     
 5 60        MECKLENBURG 000001321294  STUBBS    "BRITNE`"          ELIZABETH
 6 60        MECKLENBURG 000001309884  CLARK     "CANDERE`"         L        
 7 41        GUILFORD    000000452373  SELF      "CATHERINE`"       MARIE    
 8 60        MECKLENBURG 000000905586  FABIAN    "D`ARLINE"         D        
 9 60        MECKLENBURG 000001255029  INGRAM    "D`WON"            LAMONTE  
10 89        TYRRELL     000000001220  NICHOLS   "DORIS ( MRS W"    <NA>     
11 60        MECKLENBURG 000001272460  STEWART   "JA`VONDA"         NICHOLE  
12 62        MONTGOMERY  000000004933  SPENCER   "JAMES (JIM)"      N        
13 60        MECKLENBURG 000001297984  CARTER    "JOSE`"            PIERRE   
14 60        MECKLENBURG 000001273960  HEMPHILL  "LA`CHERICA"       EVON     
15 60        MECKLENBURG 000001330895  CHESTNUT  "LA`WANDA"         F        
16 39        GRANVILLE   000000021399  DUNN      "MARY (\"PETE\")"  BURNETTE 
17 32        DURHAM      000000678990  JERNOVICS "MARY SUSAN/"      R        
18 62        MONTGOMERY  000000016846  KERN      "O  (BUDDY)"       R        
19 1         ALAMANCE    000009070384  FOSTER    "OTIS(NMN)"        JR       
20 60        MECKLENBURG 000001289378  DAVIDOV   "ZVIYA`CRYSTAL"    <NA>     
# … with 18 more variables: name_sufx_cd <chr>, house_num <chr>,
#   half_code <chr>, street_dir <chr>, street_name <chr>, street_type_cd <chr>,
#   street_sufx_cd <chr>, unit_num <chr>, res_city_desc <chr>, state_cd <chr>,
#   zip_code <chr>, area_cd <chr>, phone_num <chr>, sex_code <chr>, sex <chr>,
#   age <chr>, birth_place <chr>, registr_dt <date>
  # knitr::kable() # some of the characters break the kable formatting
  • 102 first names with other non-alphanumeric characters
  • Some look like substitutions for hyphen or quote
  • Some are parenthetical notes
  • Some look like random cruft
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "[^ a-zA-Z0-9\\.,'-]"))

nrow(x)
[1] 1097
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
51 JOHNSTON 000001440800 KEENE JOSEPHINE (MRS OTIS) NA 600 NA N MAIN ST NA NA FOUR OAKS NC 27524 919 9632510 F FEMALE 78 NA 1964-05-16
1 ALAMANCE 000009059830 PIERMARINI JACLYN (NMN) NA 104 NA NA TROON PL NA NA ELON NC 27244 NA NA F FEMALE 25 NC 1998-02-23
1 ALAMANCE 000009051296 TORAIN ROOSEVELT (NMN) JR 3516 NA NA HAWKEYE LN NA NA HAW RIVER NC 27258 336 5783700 M MALE 36 NY 1996-07-14
1 ALAMANCE 000009024495 WILEY ROOSEVELT (NMN) JR 214 NA N ST JOHN ST NA NA BURLINGTON NC 27217 336 2277408 M MALE 51 NC 1990-10-02
1 ALAMANCE 000009052072 HARVEY NATHANIEL (NMN) NA 505 NA NA OAK ST NA NA GRAHAM NC 27253 336 2296743 M MALE 78 NC 1996-08-30
1 ALAMANCE 000001958400 EULISS MAX (NMN) NA 8207 NA NA CHARLIE EULISS RD NA NA SNOW CAMP NC 27349 NA 0000000 M MALE 86 NC 1940-05-11
50 JACKSON 000000016730 ASHE WILLARD (NMN) NA 232 NA NA GREENBRIER DR NA NA CULLOWHEE NC 28723 NA NA M MALE 83 NC 1986-04-04
1 ALAMANCE 000009069486 CAMERON CHARLES (NMN) JR 800 NA E HANOVER RD NA NA GRAHAM NC 27253 336 2221187 M MALE 64 NC 2000-07-05
1 ALAMANCE 000009010598 CRUTCHFIELD WAYNE (NMN) NA 2436 NA NA MORNINGSIDE DR NA NA BURLINGTON NC 27217 336 2297723 M MALE 46 NC 1986-10-04
1 ALAMANCE 000009016792 ROBERTS ROBIN (NMN) NA 1008 NA NA AVON AVE NA E BURLINGTON NC 27215 336 2262263 M MALE 50 NC 1988-10-01
1 ALAMANCE 000009033657 COHEN SETH (NMN) NA 122 NA NA AVENUE OF TREES NA NA NA ELON NC 27244 NA 4495922 M MALE 48 GA 1992-09-28
28 DARE 000000065347 DEMAS DOLORIS A/GEARHART NA 280 NA NA HILLCREST DR NA NA KITTY HAWK NC 27949 NA NA M MALE 61 WV 1996-10-11
78 ROBESON 000000130512 MUSSELWHITE ALLISON ELAINE/HUMPH NA 3815 NA NA ABBY LN NA NA LUMBERTON NC 28360 NA NA F FEMALE 43 NA 1988-10-10
96 WAYNE 000000001917 BASS H J (HUBERT) NA 113 NA NA LAKEVIEW DR NA NA GOLDSBORO NC 27530 919 7354827 M MALE 74 NA 1966-04-28
78 ROBESON 000000411771 BRITT ANGELA KAY / ROGERS NA 208 NA NA HIGHLAND AVE NA NA LUMBERTON NC 28358 NA NA F FEMALE 40 NA 1990-04-05
78 ROBESON 000000437992 LOCKLEAR MINNIE LEE/JONES NA 243 NA NA MCKINNON ROLLIN RD NA NA LUMBERTON NC 28358 NA NA F FEMALE 72 NA 1998-02-23
11 BUNCOMBE 000000201486 DREW VIRGINIA M/KELLEY NA 2 NA NA GLEN CABLE RD NA NA ASHEVILLE NC 28805 NA NA F FEMALE 85 NA 1997-10-10
90 UNION 000000011500 BROOKS WILLIAM MACK (BILL) NA 4013 NA NA CONCORD HWY NA NA MONROE NC 28110 NA NA M MALE 80 NA 1963-12-23
78 ROBESON 000000418303 FRIEDRICH DELLA MAE /KEYS NA 8516 NA NA NC HWY 72 NA E NA LUMBERTON NC 28358 NA NA F FEMALE 54 NA 1994-03-05
14 CALDWELL 000001461948 ALSTON MEI WAN/DAN NA 215 NA NA NEWLAND PL NA NA LENOIR NC 28645 NA NA F FEMALE 39 DC 1996-10-08
  • ~1k middle names with other non-alphanumeric characters
  • Some look like substitutions for hyphen
  • Many are parenthetical notes (NMN = no middle name)

5.3 Digits

Check for digits.

5.3.1 Zero

Check for zero

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "0"))

nrow(x)
[1] 29
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
96 WAYNE 000030025595 ALEM0N NOE A NA 158 NA NA BARWICK RD NA NA MOUNT OLIVE NC 28365 919 6350689 M MALE 18 NA 2005-07-19
44 HAYWOOD 000000060583 BOLAD0 PAULA HUTCHENS NA 352 NA NA VALLEY VIEW CIR NA NA WAYNESVILLE NC 28786 828 4564529 F FEMALE 27 CT 2000-05-02
90 UNION 000000191758 CAPUT0 BARBARA DAVIS NA 4823 NA NA MOSSY CUP LN NA NA MONROE NC 28110 NA NA F FEMALE 58 VA 2004-10-04
34 FORSYTH 000030017819 CONR0Y WILLIAM COURTNEY NA 1105 NA NA WELLESLEY PLACE DR NA NA LEWISVILLE NC 27023 NA NA M MALE 31 IL 2000-09-29
34 FORSYTH 000030046636 D0WNS MARIO ENRICO NA 108 NA NA FOXCROFT DR NA NA WINSTON SALEM NC 27103 NA NA M MALE 33 OC 2003-02-26
10 BRUNSWICK 000001459098 EAT0N VICKIE TUGGLE NA 4880 NA NA GRINDERS WAY NE NA LELAND NC 28451 910 6555932 F FEMALE 56 NC 2004-09-15
34 FORSYTH 000030038740 ESC0BEDO AUDREY ANN NA 3656 NA NA CEDAR SPRINGS DR NA D WINSTON SALEM NC 27107 NA NA F FEMALE 34 TX 2002-07-16
41 GUILFORD 000009906810 FERNANDEZ-BRAV0 GIOVANNI NA NA 1706 NA E GREEN DR NA NA HIGH POINT NC 27260 NA NA M MALE 29 NA 1999-07-09
82 SAMPSON 000000495452 GUARDAD0 MANUEL FELIX NA 200 NA NA JORDAN LN NA NA CLINTON NC 28328 910 5928129 M MALE 25 TX 2000-05-01
87 SWAIN 000000003037 J0HNSON LUCILLE FRANCES NA 76 NA NA HAYWOOD DR NA NA BRYSON CITY NC 28713 828 4883465 F FEMALE 81 NC 1980-04-01
18 CATAWBA 000030003662 JOHNS0N MICHAEL NA NA 4761 NA S NC 16 HWY NA NA MAIDEN NC 28650 828 4641363 M MALE 40 FL 2000-05-23
92 WAKE 000010284966 MCD0UGAL BETTY JEAN NA 711 NA NA ELLINGTON ST NA 101 RALEIGH NC 27601 NA NA F FEMALE 42 NC 1995-04-25
65 NEW HANOVER 000000217012 OCONN0R GERALDINE LOUISE NA 2100 NA NA MEDEIRA CT NA NA WILMINGTON NC 28405 910 2566342 F FEMALE 70 NY 2004-06-08
98 WILSON 000057476775 PEREZ-NAVARR0 CAROLE SHAY NA 915 NA NA WOODLAND DR NA NA WILSON NC 27893 252 2340088 F FEMALE 38 NJ 2001-08-25
34 FORSYTH 000030013165 R0CCO CHRISTOPHER NA NA 6701 NA NA FAIRWOOD CT NA NA CLEMMONS NC 27012 NA NA M MALE 52 NY 2000-07-13
72 PERQUIMANS 000000008086 REYN0LDS ADAM DANIEL NA 510 NA NA PENNSYLVANIA AVE NA NA HERTFORD NC 27944 252 4261317 M MALE 22 NJ 2002-06-25
34 FORSYTH 000030063231 RUSS0 ANGEL MARIE NA 3750 NA NA FIELD SEDGE DR NA NA WINSTON SALEM NC 27107 NA NA F FEMALE 22 FL 2004-09-13
100 YANCEY 000000013213 SIMPS0N MARY ANN NA 0 NA NA CHAMP RAY RD NA NA BURNSVILLE NC 28714 NA NA F FEMALE 44 NC 1996-10-08
92 WAKE 000031116874 WIT0SKY MICHAEL ADAM NA 1906 NA NA KEOKUK CT NA NA APEX NC 27523 NA NA M MALE 31 PA 2000-10-13
13 CABARRUS 000030036553 YATSK0 JEANETTE MARIE NA 1822 NA NA SUMMIT RIDGE LN NA NA KANNAPOLIS NC 28083 NA NA F FEMALE 33 PA 2004-04-26
  • 29 last names with zero
  • Substitution for O
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "0"))

nrow(x)
[1] 33
x %>%   
  dplyr::slice_sample(n = 20) %>%
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
10 BRUNSWICK 000001442802 PETTY ALLIS0N JEAN NA 5881 NA NA LAKE JOEL DR SW NA OCEAN ISLE BEACH NC 28469 910 5793428 F FEMALE 24 NY 2001-01-08
26 CUMBERLAND 000000143784 BROWN C0LBY TODD NA 232 NA NA SUMMERTIME RD NA NA FAYETTEVILLE NC 28303 910 4852726 M MALE 36 GA 1996-10-22
34 FORSYTH 000030024868 COOPER C0RDELIA P NA 790 NA NA MOTOR RD NA NA WINSTON SALEM NC 27105 NA NA F FEMALE 52 NC 2001-01-03
92 WAKE 000010017498 WHITTEMORE D0LORES H NA 5912 NA NA WHITEBUD DR NA NA RALEIGH NC 27609 NA NA F FEMALE 53 NC 1985-01-01
76 RANDOLPH 000000023670 LOWE D0NNA G NA 2294 NA NA JUNIPER CT NA NA ASHEBORO NC 27203 NA NA F FEMALE 57 NC 1972-04-06
5 ASHE 000000017610 BRANDT J0HN C NA 3902 NA NA GRASSY CREEK RD NA NA GRASSY CREEK NC 28631 NA NA M MALE 51 PA 1998-09-08
80 ROWAN 000000111803 CRANFILL J0HN NA NA 235 NA NA MISTY LN NA NA KANNAPOLIS NC 28081 NA NA M MALE 32 OC 2004-10-08
76 RANDOLPH 000000132776 ADAMS J0HN WILLIAMS NA 101 NA S COBLE ST NA NA RANDLEMAN NC 27317 NA NA M MALE 66 TN 2004-08-25
15 CAMDEN 000000006842 TANNAHILL J0SEPH ERIC NA 308 NA NA MCPHERSON RD NA NA SOUTH MILLS NC 27976 NA NA M MALE 36 CA 2001-12-12
32 DURHAM 000030060941 WILLIAMS M0NIKA UDANA NA 822 NA NA SHAY DR NA NA DURHAM NC 27704 NA NA F FEMALE 30 NC 2004-08-19
32 DURHAM 000000548599 KEENAN MARY-J0 NA NA 2407 NA NA TRYON RD NA NA DURHAM NC 27705 NA NA F FEMALE 55 NY 1983-09-09
92 WAKE 000007656080 SHEPHERD OTH0 L NA 205 NA NA ADAMS ST NA NA CARY NC 27513 NA NA M MALE 82 NC 1957-01-01
31 DUPLIN 000000016898 THOMAS P0LLY BROWN NA 706 NA E BOSTIC ST NA NA BEULAVILLE NC 28518 910 2984587 F FEMALE 76 NC 1952-01-01
32 DURHAM 000000670319 RAMIREZ REYNALD0 G NA 1316 NA NA ELMSET LN NA NA DURHAM NC 27713 NA NA M MALE 61 TX 1996-02-14
41 GUILFORD 000000632771 BUIE S0NTE Y NA 368 NA NA PHILLIPS/HAWKINS HALL NA NA NA GREENSBORO NC 27412 NA NA F FEMALE 23 TX 2004-10-08
67 ONSLOW 000000330368 MITCHELL SHANN0N ARLINE NA 521 NA NA BRIARNECK RD NA NA JACKSONVILLE NC 28540 NA 3248039 F FEMALE 22 NC 2001-07-18
59 MCDOWELL 000000029689 JOHNSON T0NYA BETH NA 0 NA NA MOFFITT HILL RD NA NA OLD FORT NC 28762 828 6684865 F FEMALE 33 NC 1990-03-16
62 MONTGOMERY 000099422992 MONK T0NYA SIVLEY NA 61 NA NA CLUBHOUSE DR NA NA NEW LONDON NC 28127 NA NA F FEMALE 44 CA 2004-09-27
92 WAKE 000031198085 RUFFIN TIM0THY ONEILL NA 617 NA NA HANSKA WAY NA NA RALEIGH NC 27610 NA NA M MALE 20 NY 2003-05-16
34 FORSYTH 000030077879 KENNEDY V0NCIEAL LEE NA 3830 NA N MAIN ST NA NA HIGH POINT NC 27265 NA NA F FEMALE 83 CO 2004-10-26
  • 33 first names with zero
  • Substitution for O
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "0"))

nrow(x)
[1] 77
x %>%   
  dplyr::slice_sample(n = 20) %>%
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
74 PITT 000000132061 PONGPAIROJ AMANDA 0 NA 1106 NA W WRIGHT RD NA NA GREENVILLE NC 27858 NA NA F FEMALE 30 MI 1998-11-03
22 CLAY 000000002562 IVESTER WILLIAM 0DELL NA 123 NA NA JACKIE COVE RD NA NA HAYESVILLE NC 28904 NA NA M MALE 43 GA 1996-02-07
10 BRUNSWICK 000000017921 MOORE EVA 0MAE NA 680 NA NA MCLAMB RD NW NA CALABASH NC 28467 NA NA F FEMALE 75 NA 1968-11-05
21 CHOWAN 000000000878 LANE KATHLEEN 0VERTON NA 107 NA NA POPLAR NECK RD NA NA EDENTON NC 27932 252 4823378 F FEMALE 65 NC 1968-04-08
92 WAKE 000031173058 SODAGAR EASA 2205 NA 2205 NA NA CONSTITUTION DR NA NA RALEIGH NC 27615 NA NA M MALE 49 OC 2002-08-16
78 ROBESON 000000459601 FRENCH SHNETTA ALEXANDER080572 NA 320 NA NA KNAPDALE RD NA NA LUMBER BRIDGE NC 28357 910 8580410 F FEMALE 33 NC 2004-10-08
34 FORSYTH 000030082101 NEWSOME MARK ANTH0NY NA 6040 NA NA WOODCOVE DR NA NA WINSTON SALEM NC 27104 NA NA M MALE 40 NC 2005-05-24
91 VANCE 000000027597 BRODIE WILLIAM C1010 NA 1010 NA NA PINKSTON ST NA NA HENDERSON NC 27536 252 4386818 M MALE 55 NC 1996-04-23
34 FORSYTH 000030072746 SMITH BRODY CO0PER NA 2700 NA NA REYNOLDA RD NA 604 WINSTON SALEM NC 27106 NA NA M MALE 27 CO 2004-10-18
82 SAMPSON 000000492048 OROPEZA AMILCAR COL0N NA 590 NA NA KING RD NA NA CLINTON NC 28328 910 5332169 M MALE 32 PR 1998-08-28
50 JACKSON 000000027449 LUCK GENA DON0HOO NA 4807 NA NA BREEDLOVE RD NA NA GLENVILLE NC 28736 828 7433390 F FEMALE 49 TX 1996-06-17
92 WAKE 000010327009 STOLLBRINK KATHY J0 NA 1013 NA NA SANTA ROSA DR NA NA APEX NC 27502 NA NA F FEMALE 46 KS 1996-03-12
30 DAVIE 000000011874 NAYLOR ANGELA LY0NS NA 108 NA NA BRAMBLEWOOD LN NA NA MOCKSVILLE NC 27028 NA NA F FEMALE 44 NC 1984-10-01
26 CUMBERLAND 000000054227 MCKOY LILLY M00RE NA 1700 NA NA SHAW RD NA NA FAYETTEVILLE NC 28311 910 4843997 F FEMALE 69 SC 1990-02-07
1 ALAMANCE 000009062764 JONES RASHAWN M0NIQUE NA 706 NA NA HUFFMAN MILL RD NA D1 BURLINGTON NC 27215 336 5383906 F FEMALE 26 NA 1998-10-06
26 CUMBERLAND 000066009587 MARSHALL MONICA NICH0LE NA 1223 NA NA NORTH ST NA NA FAYETTEVILLE NC 28301 910 2577571 F FEMALE 20 MS 2003-08-19
98 WILSON 000057488759 DAVIS LEANN RUNY0N NA 3108 NA NA WELFORD PL N NA WILSON NC 27896 252 2914327 F FEMALE 56 IL 2005-06-15
76 RANDOLPH 000000127584 HOLTON RANDY SC0TT NA 238 NA N STALEY ST NA NA STALEY NC 27355 NA NA M MALE 33 PA 2003-08-08
23 CLEVELAND 000000086780 NATION LAVONIA V0SS NA 150 NA NA COUNTRYSIDE RD NA NA KINGS MOUNTAIN NC 28086 NA NA F FEMALE 67 AL 2000-07-15
34 FORSYTH 000010103417 HASH MYRTLE Y0UNG NA 404 NA NA DONNY BROOK CT NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 66 NC 1991-04-02
  • 77 middle names with zero
  • Some are substitution for O
  • Some are in superfluous numbers

5.3.2 One

Check for one.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "1"))

nrow(x)
[1] 1
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
36 GASTON 000002883975 SATTERFIELD 111 CHARLES MASON NA 128 NA NA SPENCER MOUNTAIN RD NA NA GASTONIA NC 28056 NA NA M MALE 66 NC 1980-03-11
  • 1 last name with one
  • Substitution for I in generation suffix (111 = III)
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "1"))

nrow(x)
[1] 0
  • 0 first names with one
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "1"))

nrow(x)
[1] 39
x %>%   
  dplyr::slice_sample(n = 20) %>%
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
13 CABARRUS 000000068633 PATTERSON CARL 11 NA 2821 NA NA TWINFIELD DR NA NA CONCORD NC 28025 704 NA M MALE 51 NA 1990-02-05
13 CABARRUS 000000090508 ADAMS RALPH 11 NA 5120 NA NA WOODSIDE ST NA NA KANNAPOLIS NC 28081 704 9385014 M MALE 44 NA 1993-11-22
13 CABARRUS 000000081139 REED CHARLES 11 NA 5240 NA NA IRISH POTATO RD NA NA KANNAPOLIS NC 28083 704 7860777 M MALE 52 NA 1992-04-08
13 CABARRUS 000000092555 WILLIAMS JOSEPH 11 NA 3317 NA NA FARM LAKE DR SW NA CONCORD NC 28027 704 7849184 M MALE 46 NA 1994-04-11
13 CABARRUS 000000076502 BEST KENNETH 111 NA 1307 NA NA THOMPSON DR NA NA CONCORD NC 28025 704 NA M MALE 40 NA 1991-06-04
13 CABARRUS 000000086589 LOPEZ CARLOS 111 NA 1023 NA NA LYERLY RIDGE RD NW NA CONCORD NC 28027 704 NA M MALE 45 NA 1992-10-05
13 CABARRUS 000000070382 QUERY FRED 111 NA 1900 NA NA CAROLINA AVE NA NA KANNAPOLIS NC 28083 704 NA M MALE 33 NA 1990-03-30
13 CABARRUS 000000009451 FEATHERSTONE GEORGE 111 NA 2203 NA NA FLORIDA AVE NA NA KANNAPOLIS NC 28083 704 NA M MALE 61 NA 1968-04-16
13 CABARRUS 000000095123 FREEZE HOMER 111 NA 9915 NA NA HOLLY TREE DR NA NA CHARLOTTE NC 28215 704 5977975 M MALE 43 NA 1994-10-05
13 CABARRUS 000000081103 KLUTTZ JOE 111 NA 8512 NA NA WILLOW OAK LN NA NA HARRISBURG NC 28075 704 NA M MALE 45 NA 1992-04-08
13 CABARRUS 000000075754 MCGOVERN WILLIAM 111 NA 10209 NA NA ROBERT BOST RD NA NA MIDLAND NC 28107 704 8885934 M MALE 34 NA 1991-03-07
13 CABARRUS 000000067035 WINECOFF DAVID 111 NA 4921 NA NA ROGERS LAKE RD NA NA KANNAPOLIS NC 28081 704 9330375 M MALE 36 NA 1988-10-10
45 HENDERSON 000000059125 JOHNSON ULUS 111 NA 401 NA NA SUBSTATION ST NA B HENDERSONVILLE NC 28792 828 6972262 M MALE 46 MA 1994-06-29
13 CABARRUS 000000080879 COOKE GEORGE 111 NA 2110 NA NA DEEPWOOD PL NW NA CONCORD NC 28027 704 7842288 M MALE 57 NA 1992-05-22
33 EDGECOMBE 000000045330 HOWERIN MICHAEL DALE401 NA 401 NA E HOPE LODGE ST NA NA TARBORO NC 27886 NA NA M MALE 31 NC 1992-07-15
82 SAMPSON 000001074216 HUNTER MORDECAI J1-TO NA 140 NA NA TRAPPERS RUN LN NA NA CLINTON NC 28328 910 5645511 M MALE 27 GA 2002-11-07
92 WAKE 000031284677 FAICLOTH TIMOTHY LOUIS7100 NA 7100 NA NA BROOMFIELD WAY NA NA RALEIGH NC 27615 NA NA M MALE 45 NC 2004-10-07
93 WARREN 000000012356 BREEN TERRANCE MICHAEL146 NA 146 NA E COVE DR NA NA MACON NC 27551 919 2573548 M MALE 59 MI 1997-12-15
77 RICHMOND 000000928134 PATTERSON CARLA NADINE DOUGLAS1 NA 120 NA NA PAM ST NA NA HAMLET NC 28345 910 5828543 F FEMALE 44 NC 2004-08-23
32 DURHAM 000000379143 PLESS JOAN WRIGHT2106 NA 6 NA NA PARK PL NA NA DURHAM NC 27712 NA NA F FEMALE 62 NC 1976-09-01
  • 39 middle names with one
  • Some are substitution for I in generation suffix
  • Some are in superfluous numbers

5.3.3 Other digits

Check for other digits.

x <- d %>% 
  dplyr::filter(stringr::str_detect(last_name, "[2-9]"))

nrow(x)
[1] 1
x %>%   
  dplyr::slice_sample(n = 20) %>% 
  dplyr::arrange(last_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
76 RANDOLPH 000000110024 ALBER5TSON BASIL ERVIN NA 5297 NA NA ALBERTSON FARM RD NA NA TRINITY NC 27370 NA NA M MALE 62 NC 1999-09-25
  • 1 last name with a 5
  • Random insertion
x <- d %>% 
  dplyr::filter(stringr::str_detect(first_name, "[2-9]"))

nrow(x)
[1] 2
x %>%   
  dplyr::slice_sample(n = 20) %>%
  dplyr::arrange(first_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
91 VANCE 000000102098 SPIVEY FR4ANK THOMAS SR 136 NA NA SALT AND LIGHT LN NA NA HENDERSON NC 27537 NA NA M MALE 44 NC 2004-08-05
92 WAKE 000031272471 CHILTON J8IMMIE HERBERT NA 400 NA W ELM AVE NA NA HOLLY SPRINGS NC 27540 NA NA M MALE 56 NC 2004-09-23
  • 2 first names with digits 2-9
  • Look like random insertions
x <- d %>% 
  dplyr::filter(stringr::str_detect(midl_name, "[2-9]"))

nrow(x)
[1] 24
x %>%   
  dplyr::slice_sample(n = 20) %>%
  dplyr::arrange(midl_name, sex) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt
92 WAKE 000031173058 SODAGAR EASA 2205 NA 2205 NA NA CONSTITUTION DR NA NA RALEIGH NC 27615 NA NA M MALE 49 OC 2002-08-16
92 WAKE 000031171772 YOUNG WANWYNE 4625 NA 4625 NA NA DRAPER RD NA NA RALEIGH NC 27616 NA NA F FEMALE 31 NY 2002-08-06
92 WAKE 000031117727 CLARKE MINERVA 4932 NA 4932 NA NA BIVENS DR NA NA RALEIGH NC 27616 NA NA F FEMALE 49 OC 2000-10-12
26 CUMBERLAND 000000088495 PHAIR IDELL 8017 NA 8017 NA NA BEAVER DAM RD NA NA AUTRYVILLE NC 28318 NA NA F FEMALE 59 NA 1900-01-01
78 ROBESON 000000459601 FRENCH SHNETTA ALEXANDER080572 NA 320 NA NA KNAPDALE RD NA NA LUMBER BRIDGE NC 28357 910 8580410 F FEMALE 33 NC 2004-10-08
10 BRUNSWICK 000001443502 BEACHAM HEATHER ANDERSON9104576 NA 3976 NA NA SOUTH POINT DR SE NA SOUTHPORT NC 28461 910 2786045 F FEMALE 32 CA 2001-03-20
14 CALDWELL 000014469305 SHUMAKER RUTH ANN BURTON47 NA 47 NA NA CRESTVIEW ST NA NA GRANITE FALLS NC 28630 828 3962768 F FEMALE 45 SC 2001-04-19
78 ROBESON 000000461781 KOERNER JENNIFER ANN155 NA 155 NA NA COLEMAN DR NA NA LUMBERTON NC 28360 910 2720144 F FEMALE 35 MD 2004-06-25
66 NORTHAMPTON 000000010514 WARD EVA B2957 NA 30015 NA NA US HWY 258 NA NA NA RICH SQUARE NC 27869 252 5392484 F FEMALE 88 NA 1980-03-19
33 EDGECOMBE 000000045330 HOWERIN MICHAEL DALE401 NA 401 NA E HOPE LODGE ST NA NA TARBORO NC 27886 NA NA M MALE 31 NC 1992-07-15
25 CRAVEN 000000097831 GLOVER DIONNE LYNN1820 NA 1820 NA NA SPENCER AVE NA NA NEW BERN NC 28560 NA NA F FEMALE 30 MD 2005-09-28
64 NASH 000000018065 GUIDO DEANA LYNN2513 NA 2513 NA NA SEXTON RD NA NA ROCKY MOUNT NC 27804 NA NA F FEMALE 32 NC 1999-01-05
28 DARE 000000053857 BECHTEL TERESA MARIE103062 NA 310 NA NA OAK RUN NA NA KITTY HAWK NC 27949 NA NA F FEMALE 43 PA 1992-10-05
93 WARREN 000000012356 BREEN TERRANCE MICHAEL146 NA 146 NA E COVE DR NA NA MACON NC 27551 919 2573548 M MALE 59 MI 1997-12-15
62 MONTGOMERY 000099423041 HILL ZEB MITCHELL368 NA 368 NA NA GADDY FARM RD NA NA MT GILEAD NC 27306 910 4399391 M MALE 68 NC 2004-09-30
21 CHOWAN 000000000137 BLAIR ESSIE MIZELLE25248249 NA 509 NA NA COKE AVE NA NA EDENTON NC 27932 252 4824919 F FEMALE 89 NC 1968-04-06
18 CATAWBA 000009078800 PERKINS TERESA ROSENBAUM3305 NA 3305 NA NA 28TH ST NE NA HICKORY NC 28601 828 2562520 F FEMALE 42 NC 1996-06-09
76 RANDOLPH 000000134472 TOOMES BRIAN SCOTT3450 NA 3450 NA NA TOM BROWN RD NA NA FRANKLINVILLE NC 27248 NA NA M MALE 19 NC 2004-09-28
79 ROCKINGHAM 000000109981 PYRTLE PHILLIP W5RAY SR 133 NA NA PARKSIDE DR NA APT12 MAYODAN NC 27027 NA NA M MALE 56 NC 2004-07-30
32 DURHAM 000000379143 PLESS JOAN WRIGHT2106 NA 6 NA NA PARK PL NA NA DURHAM NC 27712 NA NA F FEMALE 62 NC 1976-09-01
  • 24 middle names with digits 2-9
  • One random insertion
  • Most appear to be superfluous numbers (from the address?)

5.4 Special words

Look for special words that shouldn’t be in names.

Define word patterns to search for.

# honorifics
w_hons <- c(
  "MR", "MISTER", "MASTER", "MRS", "MS", "MISS", 
  "REV", "REVEREND", "SR", "SISTER", "BR", "BROTHER",
  "FATHER", "MOTHER", "PASTOR", "ELDER", "BISHOP",
  "DR", "DOCTOR", "MD", "PROF", "PROFESSOR"
)

# generation suffixes
w_gen <- c(
  "JR", "JNR", "JUNIOR", "SR", "SNR", "SENIOR",
  "1ST", "2ND", "3RD", "4TH", "5TH", "6TH", "7TH", "8TH",
  "FIRST", "SECOND", "THIRD", "FOURTH", "FIFTH", "SIXTH", "SEVENTH", "EIGHTH", "EIGHTTH",
  "1", "2", "3", "4", "5", "6", "7", "8",
  "I", "II", "III", "IIII", "IV", "V", "VI"
)

# special values
w_spec <- c(
  "NN", "NMN", "NAME",
  "UNK", "UNKNOWN", "AKA", "KNOWN AS", "ALSO KNOWN AS", "ALIAS",
  "BLIND"
)

# test
w_test <- c(
  "TEST", "TST", "DUMMY", "VOTER",  "([A-Z])\\1{2,}"
)

5.4.1 Last name

# regular expression to match words
w_regexp <- 
  c(w_hons, w_gen, w_spec, w_test) %>% # all special words
  unique() %>% # make it a set
  dplyr::setdiff( # remove words that appear to mostly be validly used
    c(
      "BISHOP",
      "BLIND",
      "BROTHER",
      "DOCTOR",
      "ELDER",
      "FIRST",
      "JUNIOR",
      "MASTER",
      "MISS",
      "MISTER",
      "PASTOR",
      "SENIOR",
      "TEST",
      "THIRD",
      "VOTER"
    )
  ) %>% 
  glue::glue(x = . , "\\b{x}\\b") %>%  # must be words
  glue::glue_collapse(sep = "|") # search for any

x <- d %>% 
  dplyr::mutate(
    match = 
      last_name %>% 
      stringr::str_to_upper() %>% 
      stringr::str_replace_all(pattern = "[^ A-Z]", replacement = " ") %>% 
      stringr::str_squish() %>% 
      stringr::str_extract(pattern = w_regexp)
  ) %>% 
  dplyr::filter(!is.na(match))

nrow(x)
[1] 124
x %>% 
  dplyr::arrange(match, sex, last_name, first_name) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt match
60 MECKLENBURG 000001168096 WILLIAMSON DR IRVIN D NA 7306 NA NA GLENMONT DR NA NA CHARLOTTE NC 28227 NA NA M MALE 43 NC 2000-11-07 DR
60 MECKLENBURG 000001320093 I’ANSON-JACKSON JENNIFER NA NA 1820 NA NA INTERFACE LN NA 101 CHARLOTTE NC 28262 704 2818718 F FEMALE 37 NA 2004-10-01 I
60 MECKLENBURG 000001227377 BIREN II WILLIAM GEORGE NA 3511 NA NA LAKESIDE DR NA NA CHARLOTTE NC 28270 NA NA M MALE 24 NA 2002-10-11 II
60 MECKLENBURG 000001225864 CRITTENDON II WILLIAM BURRELL NA 2400 NA NA FOREST DR NA NA CHARLOTTE NC 28211 NA NA M MALE 22 NA 2002-10-04 II
72 PERQUIMANS 000000005410 EVANS II DONALD M NA 944 NA NA COUNTY LINE RD NA NA TYNER NC 27980 252 3562272 M MALE 45 VA 1996-10-08 II
59 MCDOWELL 000000037355 FILLINGHAM, II ROBERT E NA 0 NA NA HILLVIEW DR NA NA MARION NC 28752 828 6527120 M MALE 53 OH 1995-11-21 II
60 MECKLENBURG 000001167592 GOODWIN II PAUL J NA 2012 NA NA MALLARD PINE CT NA NA CHARLOTTE NC 28262 NA NA M MALE 28 NC 2000-11-07 II
45 HENDERSON 000000117385 GREEN II BILLY HOWARD NA 291 NA NA FAIRWAY FALLS RD NA NA ARDEN NC 28704 828 8911084 M MALE 57 MO 2005-08-01 II
11 BUNCOMBE 000000274878 METTS II CAREY MONTGOMERY NA 11 NA NA RYAN LN NA NA WEAVERVILLE NC 28787 NA NA M MALE 39 FL 2004-10-06 II
11 BUNCOMBE 000000141455 MILHORN II JOSEPH JAMES NA 112 NA NA CHURCHILL DOWNS DR NA NA FAIRVIEW NC 28730 828 6283444 M MALE 33 NA 1990-09-20 II
26 CUMBERLAND 000066020922 PERSON II DARYEL JAMES NA 4409 NA NA BISHAMON ST NA NA HOPE MILLS NC 28348 910 4299014 M MALE 26 FL 2004-06-17 II
60 MECKLENBURG 000001166422 SEABOLD II GERALD W NA 5908 NA NA AMITY PL NA NA CHARLOTTE NC 28212 NA NA M MALE 34 PA 2000-11-07 II
11 BUNCOMBE 000000199741 STANLEY II WILLIAM A NA 333 NA NA SOURWOOD DR NA NA BLACK MOUNTAIN NC 28711 828 6861100 M MALE 46 NA 1997-08-19 II
60 MECKLENBURG 000001167146 TAYLOR II ROBERT D NA 3629 NA NA MANOR HOUSE DR NA NA CHARLOTTE NC 28270 NA NA M MALE 34 SC 2000-11-21 II
23 CLEVELAND 000000072912 THOMBS II DANIEL EUGENE NA 702 NA NA STERLING DR NA NA KINGS MOUNTAIN NC 28086 NA NA M MALE 32 NC 1996-09-18 II
59 MCDOWELL 000000037122 WATSON II ROBERT NATHANIEL NA 404 NA W HENDERSON ST NA NA MARION NC 28752 NA NA M MALE 33 NC 1995-09-30 II
60 MECKLENBURG 000001323595 WORD II JOE NAHAN NA 3032 NA NA CRICKETEER DR NA NA CHARLOTTE NC 28216 704 3920489 M MALE 32 NC 2004-10-04 II
20 CHEROKEE 000000031017 AUSLEY III PRESTON ALEXANDER NA 11 NA NA PIERCY RD NA NA ANDREWS NC 28901 828 3215831 M MALE 59 FL 2005-04-11 III
60 MECKLENBURG 000001166616 BEATTY III CURTIS M NA 2200 NA NA PIMPERNEL RD NA NA CHARLOTTE NC 28213 NA NA M MALE 34 NC 2000-11-22 III
20 CHEROKEE 000000031032 BLACKWELDER III DWIGHT MCNAIRY NA 214 NA NA COOK BRIDGE RD NA NA MURPHY NC 28906 828 8370779 M MALE 57 OH 2005-04-05 III
20 CHEROKEE 000000030287 BOONE III JAMES HENRY NA 360 NA NA WOODS DR NA NA MURPHY NC 28906 828 8359046 M MALE 47 LA 2004-09-29 III
60 MECKLENBURG 000001227809 BOSQUEZ III RICHARD NA NA 624 NA NA DRURY DR NA NA CHARLOTTE NC 28206 NA NA M MALE 22 IN 2002-10-11 III
60 MECKLENBURG 000001166773 CHAPPELL III TRAVIS NA NA 12222 NA NA WICKSON CT NA NA HUNTERSVILLE NC 28078 NA NA M MALE 39 SC 2000-11-17 III
60 MECKLENBURG 000001227408 COCKERHAM III BOBBY LEE NA 701 NA NA CHEROKEE RD NA NA CHARLOTTE NC 28207 NA NA M MALE 22 NA 2002-10-11 III
60 MECKLENBURG 000001224596 CONNELL III THOMAS JOSEPH NA 912 NA NA BLACK OAK DR NA NA MATTHEWS NC 28105 NA NA M MALE 59 PA 2002-09-12 III
19 CHATHAM 000000046953 FAULKNER III HOWARD VERNON NA 2270 NA NA MANNS CHAPEL RD NA NA PITTSBORO NC 27312 919 9423588 M MALE 57 NA 1998-04-07 III
60 MECKLENBURG 000099840888 GOODWIN III WARD ALEXANDER NA 4134 NA NA ASHERTON DR NA NA CHARLOTTE NC 28226 704 5429315 M MALE 18 VA 2005-04-05 III
60 MECKLENBURG 000000957476 GROUSE III CHARLES J NA 3913 NA NA HUNTMEADOW DR NA NA CHARLOTTE NC 28269 NA NA M MALE 38 NY 1995-10-13 III
60 MECKLENBURG 000001166892 HARRIS III WILLIAM T NA 10530 NA NA PACES AVE NA 1533 MATTHEWS NC 28105 NA NA M MALE 45 CT 2000-11-16 III
60 MECKLENBURG 000001165381 KNOX III JOHN J NA 6530 NA NA BOYKIN SPANIEL RD NA NA CHARLOTTE NC 28277 NA NA M MALE 44 NA 2000-11-09 III
60 MECKLENBURG 000001177685 LANE III WILLIAM JAMES NA 6713 NA NA WILLIAM HARRY CT NA NA CHARLOTTE NC 28211 NA NA M MALE 58 MI 2001-05-16 III
20 CHEROKEE 000000031001 MCGUIRT III JAMES WILLIAM NA 568 NA NA STOCKADE MOUNTAIN RD NA NA MURPHY NC 28906 828 8379325 M MALE 62 FL 2005-03-29 III
60 MECKLENBURG 000001150953 MILLER III JOHNNIE H NA 5200 NA NA WILGROVE-MINT HILL RD NA 13 CHARLOTTE NC 28227 NA NA M MALE 25 NA 2000-10-05 III
60 MECKLENBURG 000001153905 MOORE III JAMES P NA 2825 NA NA YORKVIEW CT NA NA CHARLOTTE NC 28270 NA NA M MALE 41 MD 2000-10-09 III
60 MECKLENBURG 000001226141 NEWSOME III THOMAS LESLIE NA 17009 NA NA NORTHSTAR DR NA C HUNTERSVILLE NC 28078 NA NA M MALE 33 VA 2002-10-09 III
20 CHEROKEE 000000023294 PEACOCK III EDWARD JACKSON NA 88 NA NA UPPER RIDGE RD NA NA MARBLE NC 28905 828 8378331 M MALE 27 SC 1998-10-15 III
20 CHEROKEE 000000029803 PETERS III MARION HOWELL NA 615 NA NA HOLLY RD NA NA MURPHY NC 28906 828 6440273 M MALE 53 FL 2004-07-27 III
60 MECKLENBURG 000001223714 PRUDEN III THOMAS EUGENE NA 1255 NA NA SYDNEY DR NA NA CHARLOTTE NC 28270 NA NA M MALE 48 VA 2002-09-27 III
60 MECKLENBURG 000001150906 REDFEARN III WILBERT NA NA 1020 NA NA BELMONT AVE NA NA CHARLOTTE NC 28205 NA NA M MALE 23 NA 2000-10-09 III
60 MECKLENBURG 000001324017 SMITH III GUY R NA 1417 NA NA GRIERS GROVE RD NA NA CHARLOTTE NC 28216 704 3947608 M MALE 35 NA 2004-10-04 III
5 ASHE 000000016598 THOMPSON III EMERY NA NA 1160 NA NA WEAVER FORD RD NA NA GRASSY CREEK NC 28631 NA NA M MALE 67 NY 1997-05-06 III
87 SWAIN 000000007266 BAKER IIII WILLAIM RAINEY NA 150 NA NA BRYSON AVE NA NA BRYSON CITY NC 28713 NA NA M MALE 48 NA 1994-03-02 IIII
60 MECKLENBURG 000001165742 BUXTON IV SAMUEL R NA 4400 NA NA SIMSBURY RD NA NA CHARLOTTE NC 28226 NA NA M MALE 25 NC 2000-10-23 IV
60 MECKLENBURG 000001165921 LONG IV FLOYD M NA 8501 NA NA WOODFORD BRIDGE DR NA NA CHARLOTTE NC 28216 NA NA M MALE 25 NC 2000-10-30 IV
60 MECKLENBURG 000001165804 THOMPSON IV HARRY M NA 1120 NA NA WILLHAVEN DR NA NA CHARLOTTE NC 28211 NA NA M MALE 31 NC 2000-10-25 IV
20 CHEROKEE 000000030863 ANSELMENT JR JOSEPH LEONARD NA 1134 NA NA LONG BRANCH RD NA NA MARBLE NC 28905 828 8373669 M MALE 47 NY 2005-01-31 JR
11 BUNCOMBE 000000172434 BALL JR SAMUEL LEE NA 128 NA NA MERCHANT ST NA NA ASHEVILLE NC 28803 828 2992296 M MALE 35 NA 1994-12-07 JR
26 CUMBERLAND 000000057599 BARKLEY JR CHARLES W NA 6627 NA NA BRANNAN WAY NA NA FAYETTEVILLE NC 28314 NA NA M MALE 57 NJ 1994-08-01 JR
87 SWAIN 000000000426 BENDER JR JOHN JOHN P NA 0 NA NA US HWY 19 NA E NA BRYSON CITY NC 28713 NA NA M MALE 59 PA 1990-09-19 JR
5 ASHE 000000014095 BINGHAM JR. AMES EDMOND NA 0 NA NA ASHE PARK RD NA NA WEST JEFFERSON NC 28694 NA NA M MALE 32 CO 1994-07-26 JR
87 SWAIN 000000002548 BIRCHFIELD JR MILBURN JOEL NA 145 NA NA PINE ST NA NA BRYSON CITY NC 28713 NA NA M MALE 61 PA 1992-10-05 JR
20 CHEROKEE 000000031646 BLEDSOE JR HOMER BLAINE NA 445 NA NA HUGHES RD NA NA MURPHY NC 28906 828 8358947 M MALE 55 IN 2005-10-05 JR
60 MECKLENBURG 000001153550 BROWN JR ROBERT A NA 4947 NA NA CENTRAL AVE NA 7 CHARLOTTE NC 28205 NA NA M MALE 33 NY 2000-10-12 JR
60 MECKLENBURG 000001168114 BUNDESMAN JR BERNARD B NA 4327 NA NA CAMERON OAKS DR NA NA CHARLOTTE NC 28211 NA NA M MALE 58 GA 2000-11-07 JR
60 MECKLENBURG 000001165991 BYRD JR HERBERT L NA 16119 NA NA HOLLINGBOURNE RD NA NA HUNTERSVILLE NC 28078 NA NA M MALE 36 NC 2000-10-31 JR
56 MACON 000000039817 CAIL JR MALCOLM LEHOLMES NA 1769 NA NA OLD MURPHY RD NA NA FRANKLIN NC 28734 828 3496037 M MALE 54 GA 2004-09-20 JR
20 CHEROKEE 000000030877 CARRIER JR ROBERT WILSON NA 216 NA NA GRACE MOUNTAIN RD NA NA MURPHY NC 28906 828 8373320 M MALE 65 PA 2005-02-07 JR
20 CHEROKEE 000000031152 CHAMBERS JR KENNETH RAY NA 48 NA NA ANGEL WINGS DR NA NA MURPHY NC 28906 828 8354704 M MALE 25 GA 2005-05-24 JR
60 MECKLENBURG 000001166041 CHARLES JR WILLIE J NA 1401 NA NA KELSTON PL NA 201 CHARLOTTE NC 28212 NA NA M MALE 41 NC 2000-11-13 JR
32 DURHAM 000030080748 CLAY JR WILEY WALTON JR 117 NA NA KILT WAY NA NA DURHAM NC 27712 NA NA M MALE 46 NC 2004-10-08 JR
60 MECKLENBURG 000001165763 CLAYTON JR JAMES D NA 3126 NA NA MALLARD FOREST DR NA NA CHARLOTTE NC 28269 NA NA M MALE 36 RI 2000-10-24 JR
60 MECKLENBURG 000001168372 CULBRETH JR WALTER E NA 860 NA NA JETTON ST NA 70 DAVIDSON NC 28036 NA NA M MALE 73 NC 2000-12-04 JR
68 ORANGE 000000121958 DAYE JR. JAMES NA JR 501 NA NA LANCASTER RD NA NA MEBANE NC 27302 NA NA M MALE 31 NC 1992-11-04 JR
11 BUNCOMBE 000000179528 ENGLISH JR WARREN ROBERT NA 40 NA NA HAMBURG DR NA NA WEAVERVILLE NC 28787 828 6453500 M MALE 36 NA 1995-10-04 JR
32 DURHAM 000000691649 EVANS JR RALPH NA II 909 NA E CLUB BLVD NA NA DURHAM NC 27704 NA NA M MALE 56 NC 1997-03-11 JR
60 MECKLENBURG 000001167972 FAILLE JR EDWARD J NA 7836 NA NA ELLA JANE LN NA X CHARLOTTE NC 28273 NA NA M MALE 34 NJ 2000-11-07 JR
23 CLEVELAND 000000073378 FARMER JR BENJAMIN STEVE NA 107 NA NA KITTY CT NA NA GROVER NC 28073 NA NA M MALE 27 GA 1996-10-08 JR
60 MECKLENBURG 000001166866 FRAZIER JR JAMES A NA 7710 NA NA BABE STILLWELL FARM RD NA NA HUNTERSVILLE NC 28078 NA NA M MALE 39 NA 2000-11-15 JR
60 MECKLENBURG 000001167590 GARCIA JR FRANK NA NA 4900 NA NA CARMEL RD NA NA CHARLOTTE NC 28226 NA NA M MALE 44 TX 2000-11-07 JR
60 MECKLENBURG 000001167560 HALL JR JAMES B NA 1114 NA NA BELGRAVE PL NA NA CHARLOTTE NC 28203 NA NA M MALE 24 NC 2000-11-07 JR
60 MECKLENBURG 000001327517 HARDIN JR CHARLES ELMORE NA 1632 NA NA NEWLAND RD NA NA CHARLOTTE NC 28216 NA NA M MALE 53 NC 2004-10-06 JR
23 CLEVELAND 000000073147 HARGRAVES JR JAMES CALVIN NA 4032 NA NA LITTLEJOHN CIR NA NA SHELBY NC 28152 NA NA M MALE 58 VA 1996-10-02 JR
60 MECKLENBURG 000001325170 HARRIS JR CHAMP NA NA 511 NA NA HONEYWOOD AVE NA NA CHARLOTTE NC 28216 704 3946587 M MALE 34 NC 2004-09-23 JR
36 GASTON 000007563781 HAWKINS JR REED GREGORY NA 213 NA NA 6TH ST NA NA BELMONT NC 28012 NA NA M MALE 36 NC 1996-01-24 JR
11 BUNCOMBE 000000150829 HENSLEY JR LAWRENCE G NA 209 NA NA MORNINGSIDE DR NA NA ASHEVILLE NC 28806 828 2531379 M MALE 32 NA 1992-03-10 JR
20 CHEROKEE 000000031328 HERNDON JR EVERETT GEORGE NA 55 NA NA PUTT-PUTT LN NA NA MURPHY NC 28906 828 6445703 M MALE 59 MT 2005-07-25 JR
60 MECKLENBURG 000001167556 HILL JR JAMES C NA 2724 NA NA FAIRSTONE AVE NA NA CHARLOTTE NC 28269 NA NA M MALE 43 MI 2000-11-07 JR
60 MECKLENBURG 000001164772 HOYLE JR GEORGE A NA 5714 NA NA COPPER CREEK CT NA 16 CHARLOTTE NC 28227 704 6099491 M MALE 27 NC 2000-10-31 JR
60 MECKLENBURG 000001166447 HUMPHRIES JR DONNIE R NA 9934 NA NA HEATHERGATE LN NA NA CHARLOTTE NC 28227 NA NA M MALE 23 NC 2000-11-08 JR
60 MECKLENBURG 000001167417 KENNEDY JR THOMAS E NA 14440 NA NA CATHERINE MILLER DR NA NA CHARLOTTE NC 28273 NA NA M MALE 34 NC 2000-12-01 JR
23 CLEVELAND 000000072969 KUBU JR JERRY JOHN NA 731 NA NA BETHLEHEM CH RD NA NA KINGS MOUNTAIN NC 28086 NA NA M MALE 40 NJ 1996-09-30 JR
56 MACON 000000038890 LANE JR DAVID C NA 260 NA NA CALVARY DR NA NA OTTO NC 28763 828 3692704 M MALE 47 MD 2004-06-09 JR
26 CUMBERLAND 000000058698 LAWRENCE JR HARRY NA NA 7117 NA NA PORTO PL NA NA FAYETTEVILLE NC 28314 NA NA M MALE 53 SC 1994-08-02 JR
26 CUMBERLAND 000000053393 MARBLE JR ROBERT STERLING NA 7716 NA NA BORTHWICK DR NA NA FAYETTEVILLE NC 28306 NA NA M MALE 35 NA 1994-06-17 JR
60 MECKLENBURG 000001165676 MCCLURE JR DONALD R NA 408 NA NA COLONY ACRES DR NA NA CHARLOTTE NC 28217 NA NA M MALE 27 NC 2000-10-23 JR
60 MECKLENBURG 000001166516 MCGUIRE JR JOHN M NA 226 NA S TORRENCE ST NA 103 CHARLOTTE NC 28204 NA NA M MALE 35 OK 2000-11-15 JR
87 SWAIN 000000006773 MONGIOVI JR ANTHONY B NA 200 NA NA PARK CIR NA NA CHEROKEE NC 28719 NA NA M MALE 33 NJ 1992-09-22 JR
20 CHEROKEE 000000030947 MOORE JR HARRY GRADY NA 254 NA NA REVIS RD NA NA ANDREWS NC 28901 828 3212463 M MALE 62 TN 2005-02-22 JR
60 MECKLENBURG 000001336806 MORRISON JR WILLIAM EMERSON NA 6311 NA NA BENTRIDGE DR NA NA CHARLOTTE NC 28226 NA NA M MALE 51 NC 2004-10-07 JR
25 CRAVEN 000000095913 MOSES JR MICHAEL WILLIAM NA 1511 NA N CRAVEN ST NA NA NEW BERN NC 28560 252 5142949 M MALE 20 MD 2005-03-14 JR
60 MECKLENBURG 000001277848 NASIFE JR SAMUEL NICHOLAS NA 17604 NA NA CAMBRIDGE GROVE DR NA NA HUNTERSVILLE NC 28078 704 6346936 M MALE 33 NJ 2004-05-19 JR
60 MECKLENBURG 000001336886 OUTLAND JR HOWARD BROWN NA 5830 NA NA COUGAR LN NA NA CHARLOTTE NC 28269 704 9211855 M MALE 47 VA 2004-09-27 JR
60 MECKLENBURG 000001336057 OVERTON JR ROBERT ALLEN NA 9201 NA NA UNIVERSITY CITY BLVD NA 308 CHARLOTTE NC 28213 919 4172494 M MALE 22 VA 2004-10-05 JR
20 CHEROKEE 000000030815 PARKS JR JOEL TIMOTHY NA 520 NA NA BRASSTOWN RD NA NA MURPHY NC 28906 828 8357099 M MALE 19 SC 2004-11-08 JR
43 HARNETT 000000826344 PULSIFER JR HAROLD WINFRED NA 115 NA NA OLD POST RD NA NA ERWIN NC 28339 910 8923500 M MALE 66 ME 2004-03-23 JR
11 BUNCOMBE 000000145771 REED JR BRUCE HAL NA 1011 NA NA RICEVILLE RD NA NA ASHEVILLE NC 28805 828 2547244 M MALE 43 NA 1990-11-07 JR
11 BUNCOMBE 000000139825 ROBERTS JR GEORGE MARION NA 342 NA NA DONNYBROOK DR NA NA ASHEVILLE NC 28806 828 6652845 M MALE 74 NA 1990-08-10 JR
59 MCDOWELL 000000037352 RUSSELL, JR. KERMITT PATRICK NA 1008 NA NA GOOSE CREEK RD NA NA MARION NC 28752 828 7383705 M MALE 36 NC 1995-11-13 JR
11 BUNCOMBE 000000149836 SHADE JR EVERETTE LEE NA 0 NA NA ROCKWOOD RD NA NA ARDEN NC 28704 828 6848057 M MALE 68 NA 1992-02-21 JR
60 MECKLENBURG 000000957414 SHEALLY JR WILLIAM B NA 5126 NA NA COBURN CT NA NA CHARLOTTE NC 28277 NA NA M MALE 55 VA 1995-10-13 JR
68 ORANGE 000000017420 ST JEAN JR JOSEPH NA NA 610 NA NA HILLSBOROUGH RD NA NA CARRBORO NC 27510 NA NA M MALE 82 WA 1972-10-07 JR
60 MECKLENBURG 000001166611 STANSBERRY JR DAVID R NA 5554 NA NA OAK DR NA NA CHARLOTTE NC 28216 NA NA M MALE 25 NC 2000-11-22 JR
32 DURHAM 000030018716 STREETER JR THOMAS EARL NA 5412 NA NA KEMMONT DR NA NA DURHAM NC 27713 NA NA M MALE 24 NC 2000-10-05 JR
87 SWAIN 000000006713 VAN DOREN JR EDWARD FOSTER NA 41 NA NA US HWY 19 NA W NA BRYSON CITY NC 28713 NA NA M MALE 44 NY 1992-07-13 JR
20 CHEROKEE 000000029842 WHITEHOUSE JR JOHN JOSEPH NA 223 NA NA SEQUOYAH LN NA NA MURPHY NC 28906 828 8370472 M MALE 36 MD 2004-08-10 JR
87 SWAIN 000000006783 WHITFIELD JR RAYMOND E NA 200 NA NA PARK CIR NA NA CHEROKEE NC 28719 NA NA M MALE 32 NC 1992-09-22 JR
22 CLAY 000000014230 WIEGOLD JR RICHARD MARTIN NA 39 NA NA FAIRWAY MEADOWS DR NA NA HAYESVILLE NC 28904 828 3890624 M MALE 40 IL 2005-08-03 JR
60 MECKLENBURG 000001152500 WILLIAMSON JR SOLOMAN J NA 8351 NA NA KNIGHTS BRIDGE RD NA NA CHARLOTTE NC 28210 NA NA M MALE 33 NA 2000-10-12 JR
47 HOKE 000000025805 YOAKUM JR JC NA NA 221 NA NA SPRINGER DR NA NA RAEFORD NC 28376 910 9041330 M MALE 41 TN 2004-03-23 JR
91 VANCE 000000105186 SMITH MD PATRICIA ANN NA 2873 NA NA SATTERWHITE POINT RD NA NA HENDERSON NC 27537 252 4310096 F FEMALE 40 PA 2005-09-07 MD
92 WAKE 000009851521 VAN NAME MARY A NA 10024 NA NA SYCAMORE RD NA NA RALEIGH NC 27613 NA NA F FEMALE 50 NY 1980-01-01 NAME
80 ROWAN 000000096581 VAN NAME NANCY HIGGINS NA 1105 NA NA HEATHERDALE RD NA NA SALISBURY NC 28146 NA NA F FEMALE 49 IL 2001-08-09 NAME
68 ORANGE 000000177132 VAN NAME CHRISTOPHER PAUL NA 207 NA NA GARDNER CIR NA NA CHAPEL HILL NC 27516 NA NA M MALE 50 NY 1999-09-20 NAME
10 BRUNSWICK 000001444099 VAN NAME GARY GEORGE NA 1000 NA NA MOREHEAD RD NA NA BOILING SPRING LAKES NC 28461 NA NA M MALE 48 NJ 2001-06-08 NAME
92 WAKE 000009853513 VAN NAME MARK L NA 10024 NA NA SYCAMORE RD NA NA RALEIGH NC 27613 NA NA M MALE 50 MA 1980-01-01 NAME
92 WAKE 000031249480 BRAKE SR ESS CAROLYN G NA 4238 NA NA KAPLAN DR NA NA RALEIGH NC 27606 NA NA F FEMALE 50 NY 2004-07-29 SR
18 CATAWBA 000030027272 DOSS SR MICHAEL RAY NA 818 NA NA 2ND ST PL NE 70 HICKORY NC 28601 828 3129864 M MALE 45 SC 2004-06-01 SR
11 BUNCOMBE 000000144220 HICKS SR WILFORD LYTLE SR. 211 NA NA BRICKYARD RD NA NA ASHEVILLE NC 28806 828 2549203 M MALE 87 NA 1990-10-08 SR
23 CLEVELAND 000000072903 STIMSON SR RICHARD BARRETT NA 110 NA NA CARLIN DR NA NA KINGS MOUNTAIN NC 28086 NA NA M MALE 60 VA 1996-09-24 SR
60 MECKLENBURG 000001165089 VAUGHN SR WALTER S NA 3630 NA NA GRIFFITH ST NA NA CHARLOTTE NC 28203 NA NA M MALE 78 NA 2000-10-26 SR
23 CLEVELAND 000000073301 WHITWORTH SR RANDY SEAN NA 4005 NA NA HILLVIEW CIR NA NA SHELBY NC 28152 NA NA M MALE 30 NC 1996-09-17 SR
87 SWAIN 000000058394 V’SOSKE ERIKA DONNELL NA 13077 NA W HWY 19 HWY NA NA BRYSON CITY NC 28713 828 4886954 F FEMALE 26 OH 2003-04-02 V
57 MADISON 000000007889 MOODY V WILLIE HOLMES NA 385 NA NA PINEY GROVE RD NA NA MARSHALL NC 28753 828 6830928 M MALE 26 FL 1997-07-24 V
60 MECKLENBURG 000001166056 TENNENT V EDWARD S NA 1414 NA NA HAMORTON PL NA NA CHARLOTTE NC 28205 NA NA M MALE 31 NC 2000-11-14 V

I eyeballed the results and removed words which appeared to be mostly validly used.

Invalid words:

  • As whole field:
  • As first word:
  • As last word: DR, II, III, IIII, IV, JR, MD, SR
  • As internal word: SR

5.4.2 First name

# regular expression to match words
w_regexp <- 
  c(w_hons, w_gen, w_spec, w_test) %>% # all special words
  unique() %>% # make it a set
  dplyr::setdiff( # remove words that appear to mostly be validly used
    c(
      "BISHOP",
      "BROTHER",
      "DOCTOR",
      "ELDER",
      "JUNIOR",
      "MASTER",
      "MISTER",
      "PASTOR",
      "PROFESSOR"
    )
  ) %>% 
  glue::glue(x = . , "\\b{x}\\b") %>%  # must be words
  glue::glue_collapse(sep = "|") # search for any

x <- d %>% 
  dplyr::mutate(
    match = 
      first_name %>% 
      stringr::str_to_upper() %>% 
      stringr::str_replace_all(pattern = "[^ A-Z]", replacement = " ") %>% 
      stringr::str_squish() %>% 
      stringr::str_extract(pattern = w_regexp)
  ) %>% 
  dplyr::filter(!is.na(match))

nrow(x)
[1] 328
x %>% 
  dplyr::arrange(match, sex, last_name, first_name) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt match
92 WAKE 000009826227 AAL-ANUBIAIMHOTE DR NGOZI NA NA 1113 NA NA CARLISLE ST NA NA RALEIGH NC 27610 NA NA F FEMALE 55 NY 1978-01-01 DR
20 CHEROKEE 000000011561 WILMES FATHER JAMES NA 18 NA NA UNKNOWN NA NA NA ANDREWS NC 28901 NA NA M MALE 78 IL 1986-09-10 FATHER
60 MECKLENBURG 000000281797 ARMOUR I ELISABETH NA NA 10008 NA NA PROSPERITY POINT LN NA NA CHARLOTTE NC 28269 NA NA F FEMALE 55 NA 1976-10-01 I
60 MECKLENBURG 000001144364 BOYKIN-PRIDE I’MAN BRIANN NA 5726 NA NA OSPREY WATCH CT NA NA CHARLOTTE NC 28227 NA NA F FEMALE 24 CA 2000-09-20 I
78 ROBESON 000000457303 BRADLEY I-ASIA VICTORIA-CHERIS NA 612 NA NA KING ST NA NA LUMBERTON NC 28358 910 6189509 F FEMALE 33 NY 2004-09-08 I
34 FORSYTH 000030012392 BRITTON I-CHI GUO NA 1306 NA NA COLSTON CT NA NA KERNERSVILLE NC 27284 NA NA F FEMALE 46 OC 2000-06-27 I
60 MECKLENBURG 000001274325 BROOME I SONYA TIERRA NA 4508 NA NA SPRINGVIEW RD NA NA CHARLOTTE NC 28213 704 5995957 F FEMALE 19 NA 2004-05-14 I
32 DURHAM 000000698151 BULLARD NANCY I W NA 8228 NA NA WAKE FOREST RD NA NA DURHAM NC 27703 NA NA F FEMALE 50 NC 1998-02-11 I
34 FORSYTH 000001258800 CARLYLE I E NA 101 NA NA BELLE VISTA CT NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 105 NC 1947-04-11 I
13 CABARRUS 000000107838 CARTER I JEANNETTE GUILBE NA 3135 NA NA PATRICK HENRY DR NW I CONCORD NC 28027 NA NA F FEMALE 29 PR 1996-08-26 I
34 FORSYTH 000030024163 CHANG I-WEN NA NA 5232 NA NA OLD PLANTATION CIR NA NA WINSTON SALEM NC 27104 NA NA F FEMALE 32 OC 2000-11-15 I
60 MECKLENBURG 000001320186 COLEMAN JANA’I D NA 7316 NA NA PEBBLESTONE DR NA A CHARLOTTE NC 28212 704 5313563 F FEMALE 38 OC 2004-10-01 I
70 PASQUOTANK 000002791790 CSEH MON I WANG NA 602 NA NA TERRY ST NA NA ELIZABETH CITY NC 27909 252 3341591 F FEMALE 35 OC 2004-10-13 I
18 CATAWBA 000009025908 DESROSIERS I DARLENE NA 200 NA NA 25TH AVE NW NA HICKORY NC 28601 NA NA F FEMALE 69 NA 1984-09-14 I
92 WAKE 000010395123 DOSHI I NA NA 105 NA NA ECHOWOOD LN NA NA CARY NC 27511 NA NA F FEMALE 52 OC 1996-10-11 I
60 MECKLENBURG 000001183759 ERVIN I MEI CHOU NA 301 NA NA SETTLERS LN NA NA CHARLOTTE NC 28202 NA NA F FEMALE 34 NA 2001-08-14 I
60 MECKLENBURG 000000714275 GAYE I COLEEN M NA 15007 NA NA JUNE WASHAM RD NA NA DAVIDSON NC 28036 NA NA F FEMALE 47 NA 1989-11-01 I
67 ONSLOW 000000025430 GLASPIE I CHARLOTTE NA 1055 NA NA ROCKY RUN RD NA NA MIDWAY PARK NC 28544 NA NA F FEMALE 73 NC 1966-10-22 I
34 FORSYTH 000003105000 GREEN I A NA 113 NA NA MCKINLEY DR NA NA WINSTON SALEM NC 27107 NA NA F FEMALE 105 NC 1949-05-02 I
60 MECKLENBURG 000000951741 HALL I’MESHA L NA 700 NA NA GENTLE BREEZE DR NA NA CHARLOTTE NC 28273 NA NA F FEMALE 28 NC 1995-09-11 I
88 TRANSYLVANIA 000000029487 HEYWARD I CARTER NA 31 NA NA SEE OFF MTN RD NA NA BREVARD NC 28712 828 8849264 F FEMALE 60 NC 1996-09-07 I
32 DURHAM 000000696154 HU EDNA I JEN NA 2620 NA NA MCDOWELL RD NA NA DURHAM NC 27705 NA NA F FEMALE 42 OC 1997-10-18 I
84 STANLY 000000019484 HUNEYCUTT I SUZANNE EUDY NA 12304 NA NA ST MARTIN RD NA NA OAKBORO NC 28129 NA NA F FEMALE 53 NA 1977-07-20 I
92 WAKE 000031122791 JAN I-RAN HO NA 127 NA NA MALDON DR NA NA CARY NC 27513 NA NA F FEMALE 46 OC 2000-10-27 I
92 WAKE 000010062460 KUEHR-MCLAREN I WENDY NA 205 NA S MASON ST NA NA APEX NC 27502 NA NA F FEMALE 42 PA 1988-01-01 I
85 STOKES 000000008519 LANE I E MRS NA 1080 NA NA BLACKBURN RD NA NA PINNACLE NC 27043 NA 3682955 F FEMALE 96 NC 1969-07-26 I
26 CUMBERLAND 000066022571 LEWIS LEISA I OITERONG NA 5302 NA NA RODWELL RD NA NA FAYETTEVILLE NC 28311 910 8225458 F FEMALE 29 NA 2004-07-12 I
34 FORSYTH 000005276400 MARTIN I MARY NA 2117 NA E TWELFTH ST NA NA WINSTON SALEM NC 27101 NA NA F FEMALE 105 NC 1955-04-23 I
92 WAKE 000031080648 MENG CHENG-I C NA 205 NA NA CRESTVIEW RD NA NA RALEIGH NC 27609 NA NA F FEMALE 43 OC 2000-02-16 I
74 PITT 000030042470 MOORING I’RISHA ORCHA’ NA 3503 NA N MEMORIAL DR NA NA GREENVILLE NC 27834 NA NA F FEMALE 19 NC 2004-11-29 I
60 MECKLENBURG 000000664068 MORRIS I LANE NA NA 4801 NA NA MURRAYHILL RD NA NA CHARLOTTE NC 28209 NA NA F FEMALE 38 NC 1990-09-01 I
36 GASTON 000007581263 MULLIS LISA I BELL NA 420 NA NA WALKER DR NA NA LINCOLNTON NC 28092 NA NA F FEMALE 35 NC 1997-12-15 I
34 FORSYTH 000006029760 NEAL I M NA 227 NA NA FOXCROFT DR NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 105 NC 1960-05-10 I
43 HARNETT 000000828554 PERRY I SUN NA 78 NA NA SPIRAL BRANCH CT NA NA LINDEN NC 28356 910 8932337 F FEMALE 51 OC 2004-08-24 I
91 VANCE 000000103472 POPE I-ASIA COX NA 139 NA NA SPRINGWOOD DR NA NA HENDERSON NC 27537 252 4337757 F FEMALE 28 NC 2004-10-06 I
85 STOKES 000000014711 ROSS I G MRS NA 824 NA NA SUMMIT ST NA NA WALNUT COVE NC 27052 NA NA F FEMALE 96 NC 1969-10-26 I
32 DURHAM 000000541545 SAUNDERS VICKI I SUTTON NA 59 NA S WYCLIFF DR NA NA DURHAM NC 27703 NA NA F FEMALE 46 NC 1982-06-01 I
41 GUILFORD 000000633328 SHERWOOD I-LI BETH NA 2929 NA NA COTTAGE PL NA B GREENSBORO NC 27455 NA NA F FEMALE 23 DC 2004-10-08 I
60 MECKLENBURG 000001140673 SIMMONS I-EESHA D NA 5227 NA NA MAGNOLIA TREE LN NA NA CHARLOTTE NC 28215 704 4171852 F FEMALE 29 NC 2000-07-14 I
32 DURHAM 000030006504 SIU I-MEI NA NA 5803 NA NA TATTERSALL DR NA 16 DURHAM NC 27713 NA NA F FEMALE 34 DC 2000-02-04 I
31 DUPLIN 000000022759 SOUTHERLAND I KATHLEEN NA 1665 NA S NC 111 HWY NA NA CHINQUAPIN NC 28521 910 2983744 F FEMALE 70 NC 1988-03-10 I
45 HENDERSON 000000094726 SUMMEY I V NA 375 NA NA ZEB CORN RD NA NA HENDERSONVILLE NC 28792 828 6922989 F FEMALE 71 NC 2000-10-05 I
32 DURHAM 000000613135 SUTARIA DEBORAH I S NA 6 NA NA BLAKE CT NA NA DURHAM NC 27712 NA NA F FEMALE 53 NC 1990-09-26 I
92 WAKE 000031203388 TAI CHIH-I NA NA 6811 NA NA MIDDLEBORO DR NA NA RALEIGH NC 27612 NA NA F FEMALE 31 OC 2003-06-30 I
85 STOKES 000000017313 TUTTLE I BESSIE NA 3931 NA NA NC 89 HWY E NA NA NA WALNUT COVE NC 27052 NA NA F FEMALE 79 NC 1963-08-09 I
79 ROCKINGHAM 000000021873 WASHINGTON CAROLINE I HALEY NA 625 NA NA WINSTEAD ST NA NA REIDSVILLE NC 27320 336 3497968 F FEMALE 85 VA 1968-03-30 I
26 CUMBERLAND 000066047991 WEIR I SUN NA 3102 NA NA JOYE CIR NA NA SPRING LAKE NC 28390 910 4946953 F FEMALE 37 OC 2005-09-01 I
23 CLEVELAND 000001000649 WILEY I’AIESHA SHANTEA NA 909 NA NA FOXCROFT CIR NA APT-A SHELBY NC 28152 NA NA F FEMALE 23 NC 2001-11-15 I
51 JOHNSTON 000000290760 WOOD I F JR 1268 NA NA NC 242 HWY S NA BENSON NC 27504 919 8944028 F FEMALE 79 NA 1978-03-10 I
43 HARNETT 000000000866 ARNOLD I B NA 4395 NA NA CHRISTIAN LIGHT RD NA NA FUQUAY VARINA NC 27526 919 5524768 M MALE 88 NA 1966-10-17 I
60 MECKLENBURG 000001153427 BATES I C NA 8817 NA NA HATTON CT NA NA CHARLOTTE NC 28277 NA NA M MALE 42 NA 2000-10-11 I
78 ROBESON 000000300156 BREWER I V NA 4459 NA NA RENNERT RD NA NA SHANNON NC 28386 910 7382357 M MALE 69 NA 1982-03-08 I
92 WAKE 000010119817 CALDWELL I M NA 905 NA NA LAURENS WAY NA NA KNIGHTDALE NC 27545 NA NA M MALE 72 TX 1990-01-01 I
92 WAKE 000031220664 CHEO I-DEH NA NA 1800 NA NA CLEAN CT NA NA RALEIGH NC 27603 NA NA M MALE 66 OC 2004-01-12 I
65 NEW HANOVER 000000166698 CLARKE I MITCHELL NA 542 NA NA FOURTH AVE S NA KURE BEACH NC 28449 910 4588207 M MALE 40 NY 1998-06-22 I
20 CHEROKEE 000000019073 COLLEY I D NA 199 NA NA MOUNTAIN VIEW TRL NA NA MARBLE NC 28905 828 8371025 M MALE 72 FL 1996-06-20 I
14 CALDWELL 000014475961 DOWNS I V NA NA 4114 NA NA KENDELL PL NA NA HUDSON NC 28638 828 3962264 M MALE 82 NC 2003-10-15 I
74 PITT 000000007206 EDWARDS I J JR 208 NA NA WINDSOR RD NA NA GREENVILLE NC 27858 NA NA M MALE 81 NC 1968-04-18 I
19 CHATHAM 000000005824 FERGUSON I M JR 1206 NA NA JESSIE BRIDGE RD NA NA SILER CITY NC 27344 919 7423836 M MALE 79 NA 1972-04-01 I
32 DURHAM 000000676083 FU I-KONG BATOR NA 3 NA NA ORCHID CT NA NA DURHAM NC 27713 NA NA M MALE 36 TN 1996-05-16 I
34 FORSYTH 000003029640 GORDON I BRYCE NA 3456 NA NA LAKEVIEW DR NA NA PFAFFTOWN NC 27040 NA NA M MALE 105 NC 1950-05-13 I
63 MOORE 000000008195 GUNTER I W NA 2116 NA NA LAKEBAY RD NA NA VASS NC 28394 910 2454176 M MALE 80 NC 1968-10-05 I
32 DURHAM 000000696405 HAIG I REID S NA 305 NA NA NOTTINGHAM DR NA NA CHAPEL HILL NC 27517 NA NA M MALE 69 DC 1997-11-12 I
60 MECKLENBURG 000000477066 HICKS I FAISON NA NA 3462 NA NA SHARON RD NA NA CHARLOTTE NC 28211 NA NA M MALE 51 NC 1984-04-01 I
54 LENOIR 000000413938 HINES I ALAN NA 1821 NA NA NEUSE RD NA NA KINSTON NC 28501 252 9391102 M MALE 56 NC 2003-05-02 I
96 WAYNE 000000015543 HOOD I G NA 1604 NA NA PALM ST NA NA GOLDSBORO NC 27530 NA NA M MALE 83 NA 1956-10-20 I
28 DARE 000000050745 HOWARD I CLARENCE NA 120 NA NA JAY CREST RD NA NA KITTY HAWK NC 27949 NA NA M MALE 80 NA 1991-01-10 I
67 ONSLOW 000000078216 JENKINS I D III 407 NA NA DAWN LN NA NA JACKSONVILLE NC 28540 NA 4557443 M MALE 25 CA 1998-06-11 I
97 WILKES 000000016413 JOHNSON I M NA 291 NA NA MT VIEW CHURCH RD NA NA NORTH WILKESBORO NC 28659 NA NA M MALE 64 NC 1978-10-09 I
80 ROWAN 000000018539 JOHNSTON I C NA 1907 NA NA MOOSE RD NA NA KANNAPOLIS NC 28083 NA NA M MALE 75 NA 1968-04-20 I
50 JACKSON 000000002698 KELLY I PERRY NA 102 NA NA TEABERRY RD NA NA CULLOWHEE NC 28723 828 2931086 M MALE 80 FL 1968-10-12 I
65 NEW HANOVER 000000151272 KINLAW I W NA 952 NA NA BIRCH CREEK DR NA NA WILMINGTON NC 28403 NA 6692575 M MALE 60 NC 1996-09-18 I
92 WAKE 000004865240 LAKE I BEVERLY JR 3703 NA NA SHADYBROOK DR NA NA RALEIGH NC 27609 NA NA M MALE 71 NC 1956-01-01 I
80 ROWAN 000000011606 LITTLE I MAYO JR 929 NA NA MOCKSVILLE AVE NA NA SALISBURY NC 28144 NA NA M MALE 77 NA 1983-02-10 I
92 WAKE 000009812815 LONGMUIR I S NA 2408 NA NA TYSON ST NA NA RALEIGH NC 27612 NA NA M MALE 83 OC 1976-01-01 I
16 CARTERET 000000060960 LYONS I CHARLES NA 660 NA NA BEECH CT NA NA NEWPORT NC 28570 252 3938234 M MALE 41 NC 2000-10-19 I
41 GUILFORD 000000132252 MANESS I M NA 1413 NA NA WISE AVE NA NA HIGH POINT NC 27260 NA NA M MALE 204 NC 1949-04-14 I
26 CUMBERLAND 000000030700 MCNEIL I J NA 821 NA NA OAKCREST DR NA NA FAYETTEVILLE NC 28301 NA NA M MALE 63 NA 1900-01-01 I
11 BUNCOMBE 000000079172 MILLER I J NA 1523 NA NA OLD NC HWY 20 HWY NA NA LEICESTER NC 28748 828 6833294 M MALE 70 NA 1980-11-04 I
18 CATAWBA 000009094902 MILLER I D MCGILVRAY NA 1149 NA NA CATAWBA VALLEY BLVD SE NA HICKORY NC 28602 NA NA M MALE 26 NA 1998-11-18 I
92 WAKE 000010035687 PALMER I JEREMIAH NA 6909 NA NA ELECTRA DR NA NA RALEIGH NC 27607 NA NA M MALE 60 NY 1984-01-01 I
23 CLEVELAND 000000024412 PATTERSON I EUGENE NA 403 NA NA PATTERSON RD NA NA KINGS MOUNTAIN NC 28086 NA NA M MALE 88 NC 1964-05-16 I
7 BEAUFORT 000000016657 PAUL I B NA 111 NA N WEDGEWOOD DR NA NA WASHINGTON NC 27889 NA NA M MALE 62 NC 1965-10-23 I
90 UNION 000000024672 PLYLER I F JR 2148 NA NA SKYWAY DR NA NA MONROE NC 28110 NA NA M MALE 82 NA 1963-12-13 I
80 ROWAN 000000039146 POPE I H JR 10 NA NA MULBERRY CIR NA NA SALISBURY NC 28144 NA NA M MALE 65 NA 1968-04-06 I
92 WAKE 000006853705 POWELL I HILL NA 410 NA NA LYNN RD NA NA RALEIGH NC 27609 NA NA M MALE 62 NJ 1968-01-01 I
32 DURHAM 000030084575 POWELL STEVEN I VANROOY NA 816 NA NA GRANDVIEW DR NA NA DURHAM NC 27703 NA NA M MALE 44 DC 2005-03-09 I
31 DUPLIN 000000013605 QUINN I J NA 139 NA NA CANDLEWOOD DR NA NA WALLACE NC 28466 NA 2933524 M MALE 73 NA 1968-01-01 I
31 DUPLIN 000000013606 QUINN I J JR 406 NA NA FOX RIDGE RD NA NA WARSAW NC 28398 NA NA M MALE 43 NC 1980-10-06 I
32 DURHAM 000030042360 RUSS I V NA 820 NA NA MARTIN LUTHER KING PKWY NA 141 DURHAM NC 27713 NA NA M MALE 56 NC 2002-12-09 I
30 DAVIE 000000013844 SMITH I BRUCE NA 124 NA S BERMUDA RUN DR NA NA ADVANCE NC 27006 NA NA M MALE 84 NA 1978-10-09 I
29 DAVIDSON 000000047171 SMITH I MELVIN NA 1174 NA NA HEDGECOCK RD NA NA HIGH POINT NC 27265 NA NA M MALE 54 NA 1984-04-09 I
34 FORSYTH 000007744440 SOLOMON I S NA 4000 NA NA LELAND DR NA NA WINSTON SALEM NC 27106 NA NA M MALE 105 NC 1960-10-17 I
78 ROBESON 000000140750 STONE I L NA 132 NA NA WARWICK MILL RD NA NA LUMBERTON NC 28358 NA NA M MALE 76 NA 1965-05-04 I
60 MECKLENBURG 000001213270 TERRY I B III 14635 NA NA WATERSIDE DR NA NA CHARLOTTE NC 28278 NA NA M MALE 31 VA 2002-05-17 I
18 CATAWBA 000007887750 TRAVIS I A NA 2067 NA NA TRAVIS RD SE NA CONOVER NC 28613 NA NA M MALE 75 NA 1968-10-05 I
92 WAKE 000010240161 WAKEFIELD I NELSON NA 107 NA NA ODESSA CIR NA NA CARY NC 27513 NA NA M MALE 55 VA 1994-01-01 I
60 MECKLENBURG 000000566491 WALLACE I J NA 9916 NA NA HORTON RD NA NA CHARLOTTE NC 28278 NA NA M MALE 80 NA 1988-09-01 I
94 WASHINGTON 000000003845 WARREN I NA NA 208 NA NA ROBIN DR NA NA PLYMOUTH NC 27962 NA NA M MALE 95 NC 1964-05-09 I
60 MECKLENBURG 000001122551 WU I-CHAN JOHN NA 4711 NA NA TITLEIST DR NA NA CHARLOTTE NC 28277 NA NA M MALE 26 NA 2000-02-14 I
26 CUMBERLAND 000000040494 MANUEL WALTER III NA NA 825 NA NA SHORTRIDGE RD NA NA FAYETTEVILLE NC 28303 NA NA M MALE 57 NA 1900-01-01 III
26 CUMBERLAND 000000009547 MCPHERSON VAN III NA NA 5136 NA NA HEWITT DR NA NA FAYETTEVILLE NC 28311 NA NA M MALE 36 NA 1900-01-01 III
15 CAMDEN 000000004209 NASH SAMUEL III NA NA 960 NA NA NC HWY 343 NA N NA CAMDEN NC 27921 NA NA M MALE 62 NA 1995-12-01 III
26 CUMBERLAND 000000022512 PATALANO LOUIS III NA NA 1536 NA NA DOVER CT NA NA FAYETTEVILLE NC 28304 NA NA M MALE 58 NA 1900-01-01 III
26 CUMBERLAND 000000031817 SCOTT CALVIN III NA NA 6674 NA NA FAIRCLOTH BRIDGE RD NA NA STEDMAN NC 28391 NA NA M MALE 42 NA 1900-01-01 III
92 WAKE 000010209858 SILVER III HAYDEN NA 1025 NA NA HARVEY ST NA NA RALEIGH NC 27608 NA NA M MALE 49 OK 1992-01-01 III
5 ASHE 000000014410 COPELAND IV EDWARD JAMES NA 986 NA NA BARE CREEK ACCESS RD NA NA JEFFERSON NC 28640 NA NA M MALE 30 NC 1994-10-17 IV
26 CUMBERLAND 000000074701 ANDERSON ELBERT JR NA NA 2523 NA NA BEL AIRE ST NA NA FAYETTEVILLE NC 28306 NA NA M MALE 71 NA 1991-11-21 JR
26 CUMBERLAND 000000010027 BARNEY LEO JR NA NA 448 NA NA HORSESHOE RD NA NA FAYETTEVILLE NC 28303 NA NA M MALE 39 NC 1987-10-30 JR
26 CUMBERLAND 000000040293 BOWLES ROBERT JR NA NA 1721 NA NA DAISY LN NA NA FAYETTEVILLE NC 28303 NA NA M MALE 82 IN 1988-10-10 JR
26 CUMBERLAND 000000061313 BRYANT FREDDIE JR NA NA 7765 NA NA ADRIAN DR NA NA FAYETTEVILLE NC 28314 NA NA M MALE 50 NC 1990-12-26 JR
26 CUMBERLAND 000000055520 COLLINS JACK JR NA NA 5452 NA NA MAC TAYLOR DR NA NA WADE NC 28395 910 3232760 M MALE 50 NC 1994-02-22 JR
26 CUMBERLAND 000000024742 DARRELL JAMES JR NA NA 528 NA NA WARMSPRINGS DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 66 NA 1988-09-01 JR
26 CUMBERLAND 000000015692 DAVIS HENRY JR NA NA 2763 NA NA ELDORADO RD NA NA FAYETTEVILLE NC 28306 NA NA M MALE 52 NC 1988-03-02 JR
7 BEAUFORT 000000000372 GERTZ JR RICHARD NA 164 NA E MAIN ST NA NA WASHINGTON NC 27889 NA NA M MALE 53 OH 1982-03-26 JR
60 MECKLENBURG 000001187500 HOAGLAND JR SANDY NA 821 NA E 8TH ST NA NA CHARLOTTE NC 28202 NA NA M MALE 51 NC 2001-09-28 JR
51 JOHNSTON 000001434080 HOLLEY JR JOHN MARSHAL NA 3441 NA NA STRICKLANDS CROSSROADS NA NA NA FOUR OAKS NC 27524 NA NA M MALE 49 NA 1974-04-04 JR
54 LENOIR 000000059329 JONES JR MICHAEL NA 2990 NA NA LEMUEL DAWSON RD NA NA KINSTON NC 28501 252 5232597 M MALE 42 NY 2001-05-16 JR
42 HALIFAX 000000042048 JOYNER JR EARNEST NA 422 NA NA NC HWY 125 HWY NA NA HOBGOOD NC 27843 252 8265493 M MALE 42 NA 1993-06-11 JR
59 MCDOWELL 000000031434 MCADAMS WILL,JR NA NA 145 NA NA HIGHLAND DR NA NA MARION NC 28752 828 6529266 M MALE 56 NC 1991-10-12 JR
26 CUMBERLAND 000000033082 MCCLELLAND ERNEST JR NA NA 5451 NA NA DODGE CT NA NA FAYETTEVILLE NC 28303 NA NA M MALE 67 NA 1900-01-01 JR
5 ASHE 000000015165 MCCOY JR RICHARD TUNN NA 238 NA S BIG HORSE CREEK RD NA NA LANSING NC 28643 NA NA M MALE 46 NA 1996-02-23 JR
26 CUMBERLAND 000000009593 MCIVER SIM JR NA NA 1632 NA NA SWEETGUM CIR NA NA FAYETTEVILLE NC 28304 NA NA M MALE 62 NA 1900-01-01 JR
26 CUMBERLAND 000000021163 MCLEOD WILLIE JR NA NA 5635 NA NA DODGE DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 70 NA 1900-01-01 JR
26 CUMBERLAND 000000052302 MULL MADISON JR NA NA 618 NA NA MCLAMB DR NA NA FAYETTEVILLE NC 28301 NA NA M MALE 70 NA 1900-01-01 JR
26 CUMBERLAND 000000017799 PALMS DONALD JR NA NA 1715 NA NA STRATFORD RD NA NA FAYETTEVILLE NC 28304 NA NA M MALE 59 NA 1900-01-01 JR
26 CUMBERLAND 000000028221 PEOPLES LONZO JR NA NA 206 NA NA WAXHAW DR NA NA FAYETTEVILLE NC 28314 NA NA M MALE 67 NA 1900-01-01 JR
26 CUMBERLAND 000000056612 ROSADO ALEJANDRO JR NA NA 115 NA NA COCHRAN AVE NA NA FAYETTEVILLE NC 28301 NA NA M MALE 50 NA 1900-01-01 JR
26 CUMBERLAND 000000079877 THOMPSON JOSEPHUS JR NA NA 1442 NA NA CARNSMORE DR NA NA FAYETTEVILLE NC 28304 NA NA M MALE 56 TN 1995-01-06 JR
26 CUMBERLAND 000000009614 TILLMAN BENNIE JR NA NA 367 NA NA TUCSON DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 57 NA 1987-10-14 JR
51 JOHNSTON 000003036808 TOOLE JR NA NA 288 NA NA LITTLE RD NA NA KENLY NC 27542 NA NA M MALE 64 NA 1998-03-05 JR
26 CUMBERLAND 000000034417 WOODS HOUSTON JR NA NA 512 NA NA GLEN CANYON DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 63 NA 1900-01-01 JR
92 WAKE 000031160988 STOCKELL MD COOPER III 2801 NA NA KENTSHIRE PL NA NA APEX NC 27523 NA NA M MALE 50 TN 2002-04-02 MD
19 CHATHAM 000000048284 SPEIGHT MISS STEPHANI RENEE’ NA 365 NA NA SHAD LN NA NA APEX NC 27523 919 8512556 F FEMALE 31 NA 1998-12-07 MISS
92 WAKE 000031261107 FATE MR NA NA 3000 NA NA STONYBROOK DR NA 175 RALEIGH NC 27604 NA NA M MALE 43 PA 2004-09-07 MR
92 WAKE 000031192844 KANE MR NA NA 2648 NA NA SHEFFIELD ST NA NA APEX NC 27539 NA NA M MALE 34 NY 2003-02-20 MR
34 FORSYTH 000000455880 BECK MRS WILLIAM E NA 4113 NA NA HIGH POINT RD NA NA WINSTON SALEM NC 27107 NA NA F FEMALE 79 NC 1956-10-08 MRS
85 STOKES 000000001356 BINGMAN GRAY MRS NA NA 2708 NA NA TUTTLE RD NA NA WALNUT COVE NC 27052 NA 5917886 F FEMALE 68 NC 1969-08-02 MRS
19 CHATHAM 000000002568 BURKE MRS GEORGE W NA 860 NA NA DEAN BEAVERS RD NA NA BEAR CREEK NC 27207 919 8375350 F FEMALE 69 NA 1969-09-13 MRS
85 STOKES 000000003124 CARTER PAUL MRS NA JR 2832 NA NA PINE HALL RD NA NA BELEWS CREEK NC 27009 NA NA F FEMALE 73 NC 1969-08-09 MRS
34 FORSYTH 000001423320 CHATMAN MRS H L NA 634 NA NA HUNTER AVE NA NA WINSTON SALEM NC 27101 NA NA F FEMALE 86 NC 1952-10-25 MRS
77 RICHMOND 000000004857 COVINGTON EDNA(MRS PERRY, JR) NA 804 NA NA ASPEN LN NA NA ROCKINGHAM NC 28379 NA NA F FEMALE 0 CO 1970-04-18 MRS
34 FORSYTH 000001880400 CROMER BETTY MRS A NA 2818 NA NA REYNOLDS PARK RD NA NA WINSTON SALEM NC 27107 NA NA F FEMALE 78 NC 1952-10-25 MRS
89 TYRRELL 000000000329 DAVENPORT MRS H T NA 302 NA N WATER ST NA NA COLUMBIA NC 27925 NA NA F FEMALE 90 NA 1965-10-09 MRS
85 STOKES 000000003485 DODSON RAY MRS NA NA 1902 NA NA NC 65 HWY E NA NA NA WALNUT COVE NC 27052 NA 5917417 F FEMALE 68 NC 1969-08-02 MRS
34 FORSYTH 000002361960 EATON MRS JOHN C NA 1507 NA NA DRUMCLIFFE RD NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 81 SC 1952-10-11 MRS
34 FORSYTH 000002475480 ESTES ALMA MRS A NA 1727 NA NA JONESTOWN RD NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 82 MS 1956-10-22 MRS
19 CHATHAM 000000005898 FIELDS MRS G CLINTON NA 120 NA NA CLINT FIELDS RD NA NA BEAR CREEK NC 27207 919 8372245 F FEMALE 90 NA 1969-09-20 MRS
19 CHATHAM 000000005907 FIELDS MRS JAMES C NA 5348 NA NA RIVES CHAPEL CHURCH RD NA NA SILER CITY NC 27344 919 8372286 F FEMALE 84 NA 1969-10-11 MRS
34 FORSYTH 000002802840 FULP JAMES MRS C NA 3205 NA NA WILLISTON RD NA NA WALKERTOWN NC 27051 NA NA F FEMALE 78 NC 1952-10-18 MRS
85 STOKES 000000005431 GIBSON H MRS L NA 2766 NA NA PINE HALL RD NA NA BELEWS CREEK NC 27009 NA NA F FEMALE 78 NC 1969-08-09 MRS
85 STOKES 000000006681 GOOLSBY EUGENE MRS NA NA 1200 NA NA BAILEYTOWN CH RD NA NA WALNUT COVE NC 27052 NA 5917750 F FEMALE 79 NC 1969-08-02 MRS
77 RICHMOND 000000017627 GURGANIOUS JOHN MRS HALLIE NA 807 NA NA RICE ST NA NA HAMLET NC 28345 NA NA F FEMALE 85 NC 1952-10-18 MRS
77 RICHMOND 000000017660 HAMRICK JOHN R MRS MARGARET NA 908 NA NA HYLAN AVE NA NA HAMLET NC 28345 NA NA F FEMALE 84 CO 1956-10-27 MRS
19 CHATHAM 000000007726 HARRIS MRS FRED W NA 643 NA NA MCLAURIN RD NA NA SILER CITY NC 27344 919 8372272 F FEMALE 75 NA 1969-09-20 MRS
19 CHATHAM 000000007783 HARRIS MRS P D NA 47 NA NA MCLAURIN RD NA NA SILER CITY NC 27344 919 8375378 F FEMALE 104 NA 1969-09-20 MRS
19 CHATHAM 000000007824 HARRIS MRS WILLIAM W NA 9248 NA NA NC HIGHWAY 902 NA NA NA SILER CITY NC 27344 NA NA F FEMALE 61 NA 1969-09-13 MRS
34 FORSYTH 000003458400 HARTIS FRANK E MRS THAMES NA 2421 NA NA FARMBROOK RD NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 77 VA 1954-05-15 MRS
58 MARTIN 000000003480 HOLLIDAY MRS JOSEPH NA NA 2030 NA NA CORINTH CHURCH RD NA NA JAMESVILLE NC 27846 NA NA F FEMALE 104 NA 1956-10-20 MRS
34 FORSYTH 000004234320 JEFFERSON MRS ATHOL G NA 1221 NA NA MARDELE LN NA NA WINSTON SALEM NC 27105 NA NA F FEMALE 74 NC 1952-10-11 MRS
19 CHATHAM 000000009145 JOHNSON MRS CLYDE W NA 308 NA NA ELBERT JOHNSON RD NA NA SILER CITY NC 27344 919 7424640 F FEMALE 74 NA 1969-10-25 MRS
34 FORSYTH 000004735080 LAMB WILSON MRS C NA 142 NA NA SAINT JOHNS CT NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 82 NC 1947-04-12 MRS
85 STOKES 000000008690 LARIMORE WILLIAM MRS NA NA 1035 NA NA COUNTRY TRAIL DR NA NA WALNUT COVE NC 27052 NA 5917863 F FEMALE 63 NC 1976-08-28 MRS
92 WAKE 000031235441 LUU MRS NA NA 8605 NA NA BENZINGER DR NA NA RALEIGH NC 27613 NA NA F FEMALE 54 OC 2004-05-24 MRS
85 STOKES 000000011439 MABE STEVE MRS NA NA 4063 NA NA NC 89 HWY E NA NA NA WALNUT COVE NC 27052 NA 5914972 F FEMALE 62 NC 1969-08-09 MRS
85 STOKES 000000012275 MARTIN JAMES MRS H NA 1112 NA NA MARTIN LUTHER KING JR RD NA NA WALNUT COVE NC 27052 NA 5917978 F FEMALE 70 NC 1969-10-25 MRS
77 RICHMOND 000000018475 MASSAGEE JAMES H MRS SUE NA 103 NA NA ROSEDALE LN NA NA HAMLET NC 28345 NA NA F FEMALE 71 NC 1956-10-20 MRS
19 CHATHAM 000000012417 MOODY MRS WILLARD W NA 716 NA NA JIM MOODY RD NA NA SILER CITY NC 27344 919 7425464 F FEMALE 98 NA 1969-10-25 MRS
80 ROWAN 000000003223 MORGAN MRS ROY A NA 9110 NA NA MOORESVILLE RD NA NA MOUNT ULLA NC 28125 NA NA F FEMALE 82 NA 1968-10-05 MRS
89 TYRRELL 000000001220 NICHOLS DORIS ( MRS W NA NA 507 NA NA US HWY 64 NA E NA COLUMBIA NC 27925 NA NA F FEMALE 92 NA 1965-11-30 MRS
34 FORSYTH 000006590900 POPE MRS O N JR 1870 NA NA POPE LN NA NA KERNERSVILLE NC 27284 NA NA F FEMALE 68 NC 1968-10-04 MRS
34 FORSYTH 000006803040 REICH MRS LESTER G NA 4033 NA NA NORTHAMPTON DR NA NA WINSTON SALEM NC 27105 NA NA F FEMALE 86 NC 1952-05-06 MRS
92 WAKE 000007156880 RHONEY ROBERT MRS T NA 107 NA W AYCOCK ST NA NA RALEIGH NC 27608 NA NA F FEMALE 92 NC 1952-01-01 MRS
19 CHATHAM 000000014905 RIVES MRS WILBUR A NA 8894 NA NA NC HIGHWAY 902 NA NA NA SILER CITY NC 27344 919 8375578 F FEMALE 71 NA 1969-09-20 MRS
34 FORSYTH 000007183560 SCALES BETTY MRS H NA 3040 NA NA SPRINGHAVEN DR NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 69 NC 1960-06-16 MRS
34 FORSYTH 000007686000 SMITH MRS WILLIAM JOE DAVIS NA 5875 NA NA LORADALE DR NA NA KERNERSVILLE NC 27284 NA NA F FEMALE 72 NC 1956-10-16 MRS
34 FORSYTH 000008362080 TIMMONS THOMAS MRS E NA 4540 NA NA INWOOD DR NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 75 NA 1958-05-17 MRS
34 FORSYTH 000008421840 TRULL JAMES MRS T NA 862 NA NA KENWICK DR NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 75 NC 1960-05-12 MRS
34 FORSYTH 000008739960 WARD MARVIN MRS M NA 641 NA NA YORKSHIRE RD NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 89 NC 1947-04-09 MRS
34 FORSYTH 000008982120 WHITE JOE MRS MRS NA 1212 NA NA MAGNOLIA ST NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 86 NC 1958-03-28 MRS
89 TYRRELL 000000001550 WOODLEY MRS WALLACE ( RUTH ) NA 75 NA NA MOORE RD NA NA COLUMBIA NC 27925 NA NA F FEMALE 78 NA 1965-10-09 MRS
50 JACKSON 000000017145 QUEEN GERALDINE(NMN NA NA 757 NA NA CANADA RD NA NA TUCKASEGEE NC 28783 828 2933801 F FEMALE 61 NC 1986-06-24 NMN
23 CLEVELAND 000000038469 BORDERS EUGENE(NMN) NA NA 2009 NA NA STONY POINT RD NA NA SHELBY NC 28150 NA NA M MALE 61 NC 1984-04-01 NMN
1 ALAMANCE 000009070384 FOSTER OTIS(NMN) JR NA 807 NA NA BRANDON MILL CT NA NA ELON NC 27244 336 5860785 M MALE 62 NC 2000-09-08 NMN
11 BUNCOMBE 000000155823 FEATHERSTONE REV. ROBERT A NA 605 NA NA CROWFIELDS LN NA NA ASHEVILLE NC 28803 828 2749494 M MALE 83 NA 1992-08-24 REV
82 SAMPSON 000001075254 GILDEA SISTER THERESINE NA 221 NA NA GRAHAM ST NA NA CLINTON NC 28328 910 5962110 F FEMALE 69 PA 2004-02-04 SISTER
41 GUILFORD 000000115101 KELLY SISTER ANN NA 2206 NA N ELM ST NA NA GREENSBORO NC 27408 NA NA F FEMALE 71 MD 1976-08-27 SISTER
83 SCOTLAND 000000213016 PEGUESE SISTER GIRTRUE NA 609 NA NA ANNE ST NA NA LAURINBURG NC 28352 NA NA F FEMALE 47 NA 1990-04-20 SISTER
55 LINCOLN 000000022736 ROSS SISTER S NA 2297 NA NA WALNUT ST NA NA LINCOLNTON NC 28092 NA NA F FEMALE 79 NC 1968-03-16 SISTER
82 SAMPSON 000001075251 TANCRAITOR SISTER MAXINE ELIZABETH NA 221 NA NA GRAHAM ST NA NA CLINTON NC 28328 910 5962210 F FEMALE 73 PA 2004-02-04 SISTER
26 CUMBERLAND 000000053337 DUNTON JULIAN SR NA NA 3305 NA NA CRANBROOK DR NA NA FAYETTEVILLE NC 28301 NA NA M MALE 0 SC 1978-10-07 SR
26 CUMBERLAND 000000006677 GRAHAM STEPHEN SR LEGREE NA 1827 NA NA RIVERGATE RD NA NA FAYETTEVILLE NC 28304 NA NA M MALE 60 NC 1994-07-19 SR
34 FORSYTH 000010177562 PHILLIPS SR DAYLE KELLEY NA 515 NA NA CLARIDGE CIR NA NA WINSTON SALEM NC 27106 NA NA M MALE 71 PA 1996-10-11 SR
18 CATAWBA 000009092798 ADAMS V JAN NA 6819 NA NA GREEDY HWY NA NA HICKORY NC 28602 NA NA F FEMALE 51 NA 1998-07-15 V
20 CHEROKEE 000000009472 ANDERSON V RUTH K NA 746 NA NA BATES CREEK RD NA NA MURPHY NC 28906 NA NA F FEMALE 66 NC 1969-07-19 V
92 WAKE 000010288652 BATKIN V MARIA NA 1314 NA NA BROOK ARBOR DR NA NA CARY NC 27519 NA NA F FEMALE 52 OC 1995-03-15 V
60 MECKLENBURG 000001330950 BENFIELD RHONDA V NA NA 2518 NA NA TODDVILLE RD NA NA CHARLOTTE NC 28214 NA NA F FEMALE 52 SC 2004-10-08 V
41 GUILFORD 000000019450 BOWDEN V RUTH NA 3243 NA NA PLEASANT RIDGE RD NA NA SUMMERFIELD NC 27358 NA NA F FEMALE 63 NC 1987-10-19 V
57 MADISON 000000001189 BOYD V MARIE NA 1040 NA N MAIN ST NA 203 MARSHALL NC 28753 NA NA F FEMALE 84 NC 1996-10-08 V
100 YANCEY 000000012768 BRANDT V KATHLEEN GRY NA 1872 NA NA MOONSHINE MOUNTAIN RD NA NA BURNSVILLE NC 28714 NA NA F FEMALE 61 PA 1996-04-29 V
23 CLEVELAND 000001000292 CALHOUN V ANNE NA 105 NA NA JOY DR NA NA CHERRYVILLE NC 28021 NA NA F FEMALE 42 CA 2001-10-10 V
88 TRANSYLVANIA 000030007228 CARLAND V ANN NA 128 NA NA COLLEGE WALK LN NA NA BREVARD NC 28712 828 8834555 F FEMALE 79 NC 2004-09-14 V
34 FORSYTH 000001317360 CARTER PAUL V MRS NA 3309 NA NA POLLARD DR NA NA WINSTON SALEM NC 27103 NA NA F FEMALE 76 NC 1952-10-25 V
20 CHEROKEE 000000030170 CAVENDER V DORIS NA 494 NA NA ROCKY CREEK RD NA 3 ANDREWS NC 28901 828 3210711 F FEMALE 72 NC 2004-09-22 V
32 DURHAM 000000671744 COOK INEZ V CARY NA 1829 NA NA FRONT ST NA G8 DURHAM NC 27705 NA NA F FEMALE 90 NY 1996-03-18 V
12 BURKE 000000085485 DALBERG V ANDREA NA 12 NA NA MCKEE DR NA NA MORGANTON NC 28655 NA NA F FEMALE 58 WI 2004-10-05 V
45 HENDERSON 000000050819 DOTY V’ONA GILBERT NA 347 NA NA JACKSON RD NA NA FLETCHER NC 28732 828 6845871 F FEMALE 83 CT 1989-11-14 V
100 YANCEY 000000003159 EDWARDS V ERLINE NA 4940 NA NA US HWY 19 NA NA NA BURNSVILLE NC 28714 NA NA F FEMALE 83 NC 1970-10-23 V
63 MOORE 000000070910 EVANS-SMITH V MARIE HUMPHERY NA 782 NA NA DAYFLOWER CT NA NA WOODLAKE NC 28394 910 2453027 F FEMALE 56 VA 1998-11-24 V
92 WAKE 000010379147 FINLEY V ANNE NA 3701 NA NA SUMMER PL NA NA RALEIGH NC 27604 NA NA F FEMALE 56 NY 1997-05-27 V
71 PENDER 000000001495 FUTRELL V JEANINE BOWDEN NA 1378 NA NA PENDERLEA HWY NA NA BURGAW NC 28425 910 2594012 F FEMALE 38 NC 1988-10-08 V
48 HYDE 000000001770 GIBBS V WILLA NA 12270 NA NA NORTH LAKE RD NA NA ENGELHARD NC 27824 NA NA F FEMALE 91 NC 1968-03-30 V
32 DURHAM 000030074802 GLENN V’SHATAVIA D NA 1801 NA NA FAYETTEVILLE ST NA NA DURHAM NC 27707 NA NA F FEMALE 20 NC 2004-10-05 V
54 LENOIR 000000033745 HALL CATHEDRIA V HOOKER NA 1513 NA NA CHARLOTTE AVE NA NA KINSTON NC 28501 252 9393871 F FEMALE 40 NC 1995-01-10 V
50 JACKSON 000000028608 HALL V JUANITA NA 896 NA NA SPRING FOREST RD NA NA CASHIERS NC 28717 828 7435256 F FEMALE 74 MO 1996-10-11 V
60 MECKLENBURG 000000368267 HAMILTON V KAYE NA NA 7326 NA NA HANNAH ALEXANDER LN NA NA CHARLOTTE NC 28227 NA NA F FEMALE 60 NA 1978-08-01 V
32 DURHAM 000000668400 JAYANTY LAKSHMI S V S NA 19 NA NA FALLING WATER DR NA NA DURHAM NC 27713 NA NA F FEMALE 52 OC 1995-12-14 V
92 WAKE 000010445608 JOHNSON V JOLINE NA 4504 NA NA HAMPTONSHIRE DR NA NA RALEIGH NC 27613 NA NA F FEMALE 69 CO 1998-08-12 V
34 FORSYTH 000030077879 KENNEDY V0NCIEAL LEE NA 3830 NA N MAIN ST NA NA HIGH POINT NC 27265 NA NA F FEMALE 83 CO 2004-10-26 V
34 FORSYTH 000004711560 KRITES V C NA 902 NA W CLEMMONSVILLE RD NA NA WINSTON SALEM NC 27127 NA NA F FEMALE 105 NC 1940-10-26 V
92 WAKE 000031181682 LANCASTER ALDA V LIMBAUGH NA 1711 NA NA KUDROW LN NA NA MORRISVILLE NC 27560 NA NA F FEMALE 81 MI 2002-10-11 V
56 MACON 000000005030 LEE V JUANITA NA 186 NA NA HORIZON HILLS DR NA NA FRANKLIN NC 28734 NA NA F FEMALE 75 NC 1976-10-04 V
42 HALIFAX 000000020666 LEE V FLORENCE NA 1500 NA NA MILLS RD NA NA HOLLISTER NC 27844 NA NA F FEMALE 0 NA 1960-05-14 V
73 PERSON 000000015184 LYONS V BETTIE NA 714 NA NA WOODSDALE RD NA NA ROXBORO NC 27574 NA NA F FEMALE 83 NC 1968-04-20 V
36 GASTON 000002081840 MARSHALL CALLIE V. LUTZ NA 618 NA E OHIO AVE NA NA BESSEMER CITY NC 28016 NA NA F FEMALE 80 NC 1964-09-19 V
49 IREDELL 000005737600 MAYBERRY V JACQUELINE NA 145 NA NA RIDGE VALLEY DR NA NA STATESVILLE NC 28677 NA NA F FEMALE 0 NC 1964-10-10 V
60 MECKLENBURG 000000322013 MOCK V CHARLENE D NA 2401 NA NA WENSLEY DR NA NA CHARLOTTE NC 28210 NA NA F FEMALE 61 NA 1983-11-01 V
60 MECKLENBURG 000001008505 MOORMAN V E NA 1900 NA NA HERRIN AVE NA NA CHARLOTTE NC 28205 NA NA F FEMALE 52 MD 1996-09-15 V
36 GASTON 000007572097 MORTON SANDRA V GOSNELL NA 2709 NA NA LOWELL RD NA NA GASTONIA NC 28054 NA NA F FEMALE 44 NC 1996-10-03 V
45 HENDERSON 000000113610 OSLEY V BONITA NAFZIGER NA 523 NA NA BOONE TOP NA NA NA ZIRCONIA NC 28790 828 6924115 F FEMALE 53 PA 2004-10-08 V
81 RUTHERFORD 000000040280 OWENSBY V ANN NA 631 NA NA POORS FORD RD NA NA RUTHERFORDTON NC 28139 828 2860903 F FEMALE 55 NC 1994-09-06 V
41 GUILFORD 000000164409 PAYNE V LUCILLE NA 1111 NA NA MT HOPE CHURCH RD NA NA MCLEANSVILLE NC 27301 NA NA F FEMALE 79 NC 1956-05-01 V
92 WAKE 000031076032 PERERA V MALLIKA NA 1804 NA NA WALDEN MEADOW DR NA NA APEX NC 27523 NA NA F FEMALE 60 OC 1999-12-07 V
25 CRAVEN 000000012991 POWELL V ESTELLE NA 519 NA NA QUEEN ST NA NA NEW BERN NC 28560 NA 0000000 F FEMALE 85 NC 1968-04-19 V
49 IREDELL 000007533600 RASH V ANDERSON NA 176 NA NA BIGHAM LN NA NA STATESVILLE NC 28677 NA NA F FEMALE 86 NC 1964-10-10 V
34 FORSYTH 000006738480 RAY V FRANCIS NA 3081 NA NA KINNAMON RD NA NA WINSTON SALEM NC 27104 NA NA F FEMALE 105 NC 1952-10-11 V
32 DURHAM 000000560485 SEMONCHE LAURA V A NA 2650 NA NA UNIVERSITY DR NA NA DURHAM NC 27707 NA NA F FEMALE 40 NC 1984-04-09 V
45 HENDERSON 000000014832 SHAFFER V LYNNE STRICKLAND NA 1008 NA NA MOUNTAIN VIEW ST NA NA HENDERSONVILLE NC 28739 828 6983878 F FEMALE 46 NC 1980-08-21 V
80 ROWAN 000000019026 SHELF V S MRS NA 809 NA NA BOSTIAN AVE NA NA KANNAPOLIS NC 28083 NA NA F FEMALE 92 NA 1968-04-06 V
11 BUNCOMBE 000000005285 SMELTZER V DIANE NA 449 NA NA VANDERBILT RD NA NA ASHEVILLE NC 28803 828 2747708 F FEMALE 55 NA 1980-09-11 V
16 CARTERET 000000001547 SMITH V RAE NA 405 NA NA BARBOUR RD NA 194 MOREHEAD CITY NC 28557 252 7260743 F FEMALE 81 NC 1979-03-20 V
63 MOORE 000000055727 STANTON V GAYLE NA 256 NA NA CLIFFMAN RD NA NA PINEBLUFF NC 28373 910 2815400 F FEMALE 61 FL 1994-10-07 V
92 WAKE 000010401214 STERLING V LEE NA 1604 NA NA LAURA DUNCAN RD NA NA APEX NC 27502 NA NA F FEMALE 41 IL 1997-10-10 V
64 NASH 000000076835 STODDARD V CHRISTIVE NA 512 NA NA COLONY SQ NA NA ROCKY MOUNT NC 27804 252 8241618 F FEMALE 61 MD 2004-06-09 V
60 MECKLENBURG 000001177848 STREIFF CONNIE V R NA 15213 NA NA MCKNITT LN NA NA CHARLOTTE NC 28277 NA NA F FEMALE 54 NJ 2001-05-14 V
18 CATAWBA 000009034909 TEAGUE V MICHELLE NA 3331 NA NA 3RD ST DR NW NA HICKORY NC 28601 NA NA F FEMALE 35 NA 1988-02-05 V
62 MONTGOMERY 000000004983 TERRY CAROLYN V MASK NA 101 NA W SCARBORO ST NA NA MT GILEAD NC 27306 NA NA F FEMALE 58 NC 1970-04-04 V
41 GUILFORD 000009705376 THOMPSON V DELORES NA 1506 NA NA CHATHAM DR NA NA HIGH POINT NC 27265 NA NA F FEMALE 50 SC 1997-07-02 V
65 NEW HANOVER 000000043039 TINNEY V LEE W NA 515 NA NA MCEACHERN CT NA NA WILMINGTON NC 28412 NA NA F FEMALE 50 NC 1978-12-14 V
6 AVERY 000000012794 VANNOY V GAIL NA 616 NA NA CRANBERRY ST NA NA NEWLAND NC 28657 NA NA F FEMALE 60 NC 1995-10-06 V
60 MECKLENBURG 000000889414 WAGGONER V C NA 3201 NA NA MOORES GLEN DR NA NA CHARLOTTE NC 28209 NA NA F FEMALE 59 MD 1992-09-30 V
41 GUILFORD 000000221856 WALKER V FRANCES NA 523 NA NA AUDUBON DR NA NA GREENSBORO NC 27410 NA NA F FEMALE 69 NC 1972-10-07 V
60 MECKLENBURG 000000326358 WHITE V CAROLE NA NA 21516 NA NA DELFTMERE DR NA NA CORNELIUS NC 28031 NA NA F FEMALE 67 NA 1960-10-01 V
39 GRANVILLE 000000008868 WILLIAMS JACQUELYNE V. MOORE NA 609 NA NA 27TH ST NA NA BUTNER NC 27509 NA NA F FEMALE 44 NC 1980-09-29 V
23 CLEVELAND 000000034666 WRIGHT O V LEDFORD NA 1719 NA N POST RD NA NA SHELBY NC 28150 NA NA F FEMALE 86 NC 1968-10-26 V
78 ROBESON 000000442515 ADAMS A V NA NA 2322 NA NA MEADOW RD NA NA LUMBERTON NC 28358 NA NA M MALE 58 NA 1999-01-07 V
92 WAKE 000000002437 ADAMS V WAYNE NA 6817 NA NA VALLEY DR NA NA RALEIGH NC 27612 NA NA M MALE 72 IL 1979-01-01 V
51 JOHNSTON 000000106500 ALLEN V B NA 500 NA NA IVEY RD NA NA FOUR OAKS NC 27524 919 8943302 M MALE 97 NA 1964-05-16 V
41 GUILFORD 000000007239 AVVA V SARMA NA 4109 NA NA LAWNDALE PL NA NA GREENSBORO NC 27455 NA NA M MALE 73 NA 1981-10-02 V
18 CATAWBA 000009024237 BARBOUR V KEITH NA 3225 NA NA 27TH ST NE NA HICKORY NC 28601 NA NA M MALE 45 NA 1984-08-31 V
66 NORTHAMPTON 000000011040 BAZEMORE V S NA 171 NA NA LASSITER MILL RD NA NA WOODLAND NC 27897 NA NA M MALE 85 NC 1952-05-17 V
76 RANDOLPH 000000003145 BOWMAN V C NA 2262 NA NA RAMSEUR JULIAN RD NA NA LIBERTY NC 27298 NA NA M MALE 69 FL 1960-05-14 V
92 WAKE 000009800407 BOYKIN V RAYMOND JR 1500 NA NA CLARK FARM RD NA NA APEX NC 27502 NA NA M MALE 60 NC 1977-01-01 V
81 RUTHERFORD 000000016926 CLINE V OTHO JR 201 NA NA WESTBROOK DR NA NA RUTHERFORDTON NC 28139 828 2869802 M MALE 74 NC 1969-08-09 V
26 CUMBERLAND 000000058318 CORRELL V C NA 6617 NA NA IDLEWOOD CT NA NA FAYETTEVILLE NC 28314 NA NA M MALE 75 NC 1980-01-28 V
34 FORSYTH 000030076409 DEAL R V ROB NA 1900 NA W FIRST ST NA NA WINSTON SALEM NC 27104 NA NA M MALE 75 NC 2004-10-08 V
85 STOKES 000000033405 DEHART V L JR 1010 NA NA DEHART RD NA NA WALNUT COVE NC 27052 NA NA M MALE 49 NC 1993-10-11 V
19 CHATHAM 000000004985 DREYER V DEAN NA 35 NA NA COTTEN ST NA NA PITTSBORO NC 27312 919 5425239 M MALE 90 NA 1983-09-27 V
45 HENDERSON 000000082489 GORDON V H NA 153 NA NA GORDON VIEW DR NA NA ZIRCONIA NC 28790 NA NA M MALE 57 NC 1998-08-11 V
60 MECKLENBURG 000000463605 HELTON V JOHNNY NA NA 6801 NA NA PINE LAKE LN NA NA CHARLOTTE NC 28227 NA NA M MALE 59 NA 1980-11-01 V
34 FORSYTH 000003728040 HICKS V L NA 4758 NA NA CARLTON DR NA NA WINSTON SALEM NC 27105 NA NA M MALE 76 NC 1952-05-10 V
56 MACON 000000007439 HOLLAND V L NA 664 NA NA MASHBURN BRANCH RD NA NA FRANKLIN NC 28734 NA NA M MALE 81 NC 1973-10-05 V
26 CUMBERLAND 000000020674 HOLLINSHED V E JR 2533 NA NA MIRROR LAKE DR NA NA FAYETTEVILLE NC 28303 NA NA M MALE 0 NA 1900-01-01 V
11 BUNCOMBE 000000170334 HONEYCUTT V J NA 243 NA NA YOUNGS COVE RD NA NA CANDLER NC 28715 NA NA M MALE 60 NA 1994-09-22 V
92 WAKE 000010279488 HOUSEHOLDER V R NA 7020 NA NA PENNY RD NA NA RALEIGH NC 27606 NA NA M MALE 60 WV 1995-02-10 V
34 FORSYTH 000004120320 IDOL V F NA 785 NA NA TWIN CREEK RD NA NA KERNERSVILLE NC 27284 NA NA M MALE 105 NC 1944-10-14 V
92 WAKE 000004184540 IRAGGI V J NA 431 NA NA FENTON ST NA NA RALEIGH NC 27604 NA NA M MALE 84 NY 1952-01-01 V
32 DURHAM 000000680083 IYER V V NA NA 201 NA NA IVY MEADOW LN NA NA DURHAM NC 27707 NA NA M MALE 30 OH 1996-09-05 V
92 WAKE 000004222220 JACKSON V L NA 5609 NA NA DEBLYN AVE NA NA RALEIGH NC 27612 NA NA M MALE 80 NC 1971-01-01 V
92 WAKE 000031120908 JEFFRIES V’GER S NA 218 NA NA FOX RUN DR NA NA WENDELL NC 27591 NA NA M MALE 25 NC 2000-10-13 V
92 WAKE 000004566430 JONES V W NA 4013 NA NA CASTLE CT NA NA RALEIGH NC 27613 NA NA M MALE 65 NC 1974-01-01 V
10 BRUNSWICK 000000065743 KRASNIEWICZ V A NA 400 NA S MIDDLETON DR NW NA CALABASH NC 28467 910 2872013 M MALE 67 NY 1999-02-02 V
34 FORSYTH 000004711440 KRITES V C NA 902 NA W CLEMMONSVILLE RD NA NA WINSTON SALEM NC 27127 NA NA M MALE 105 NC 1940-10-26 V
19 CHATHAM 000000042414 KRYSTOFIAK V L NA 32 NA NA MCDOWELL NA NA NA PITTSBORO NC 27312 919 5450928 M MALE 80 NA 1996-06-21 V
34 FORSYTH 000004925520 LEWIS V M NA 112 NA NA NELSON ST NA NA KERNERSVILLE NC 27284 NA NA M MALE 105 NC 1944-10-23 V
60 MECKLENBURG 000000457797 LIND V WILLIAM NA JR 2006 NA NA STEDWICK PL NA NA CHARLOTTE NC 28211 NA NA M MALE 60 NA 1982-09-01 V
84 STANLY 000000023481 LOCKAMY V B NA 810 NA N 5TH ST NA NA ALBEMARLE NC 28001 NA NA M MALE 87 NA 1962-05-05 V
60 MECKLENBURG 000000601393 LOMBARDI V ALAN NA NA 3308 NA NA FRENCH WOODS RD NA NA CHARLOTTE NC 28269 NA NA M MALE 51 NA 1988-05-01 V
60 MECKLENBURG 000001141172 MANGIPUDI V RAO NA NA 16401 NA NA BRIDGEHAMPTON CLUB DR NA NA CHARLOTTE NC 28277 NA NA M MALE 65 NA 2000-09-01 V
64 NASH 000000079919 MANN R V NA NA 12072 NA NA NC 48 NA NA NA WHITAKERS NC 27891 252 4371933 M MALE 59 NC 2004-09-04 V
34 FORSYTH 000005301370 MARTIN V GRAY JR 1601 NA NA CHICKASHA DR NA NA PFAFFTOWN NC 27040 NA 9455756 M MALE 54 NC 1971-11-04 V
92 WAKE 000005429160 MATHENY V O JR 3712 NA NA ESSIE ST NA NA RALEIGH NC 27610 NA NA M MALE 71 NC 1956-01-01 V
80 ROWAN 000000022939 MCKINNEY V A NA 4811 NA NA MOORESVILLE RD NA NA SALISBURY NC 28147 NA NA M MALE 80 NA 1968-04-20 V
58 MARTIN 000000003834 MODLIN V WAYNE NA 1050 NA NA CLAUDE MOBLEY LN NA NA JAMESVILLE NC 27846 NA NA M MALE 53 NC 1972-10-04 V
60 MECKLENBURG 000000926932 NORMAN V WAYNE NA NA 2769 NA NA VON THURINGER CT NA NA CHARLOTTE NC 28210 NA NA M MALE 62 NC 1994-10-17 V
60 MECKLENBURG 000001168573 OAKLEY V BRADSHER III 11548 NA NA SAVANNAH CREEK DR NA NA CHARLOTTE NC 28273 NA NA M MALE 33 VA 2000-10-15 V
60 MECKLENBURG 000000993576 OATES A V NA NA 3817 NA NA VINING CT NA NA CHARLOTTE NC 28216 NA NA M MALE 70 SC 1996-06-26 V
92 WAKE 000010055261 OGLESBY V BOYCE JR 315 NA NA CHANTICLAIR DR NA NA APEX NC 27502 NA NA M MALE 43 VA 1986-01-01 V
60 MECKLENBURG 000000408435 PFAHL V KEVIN NA NA 1900 NA NA VERNON DR NA NA CHARLOTTE NC 28211 NA NA M MALE 50 NA 1980-04-01 V
60 MECKLENBURG 000000436494 PIERANNUNZI V PAUL NA NA 8411 NA NA SILVER MAPLE LN NA NA CHARLOTTE NC 28227 NA NA M MALE 60 NA 1980-09-01 V
92 WAKE 000009802139 PLAYER V STEPHEN NA 8113 NA NA GLENBRITTLE WAY NA NA RALEIGH NC 27615 NA NA M MALE 58 FL 1977-01-01 V
23 CLEVELAND 000000025575 POWELL V A JR 2033 NA W STAGE COACH TRL NA NA SHELBY NC 28150 NA NA M MALE 75 NC 1964-04-25 V
11 BUNCOMBE 000000275005 RASH A V NA NA 10 NA NA DENVER DR NA NA WEAVERVILLE NC 28787 NA NA M MALE 70 NC 2004-10-06 V
49 IREDELL 000007628100 REDMOND V PRESTON JR 144 NA NA DEER TRAIL RD NA NA STATESVILLE NC 28677 NA NA M MALE 57 NC 1970-10-01 V
78 ROBESON 000000291458 REVELS V D NA NA 245 NA NA APOLLO LN NA NA ROWLAND NC 28383 NA NA M MALE 60 NA 1975-04-09 V
99 YADKIN 000000011317 REYNOLDS V FRANK NA 2121 NA NA SHACKTOWN RD NA NA YADKINVILLE NC 27055 336 9612545 M MALE 89 NC 1966-05-11 V
76 RANDOLPH 000000032302 RUMLEY V CLIFTON NA 535 NA NA NC HWY 42 NA S NA ASHEBORO NC 27205 NA NA M MALE 85 NC 1954-05-15 V
60 MECKLENBURG 000000941506 SCALDARA A V NA NA 7210 NA NA BROADFORD CT NA NA CHARLOTTE NC 28277 NA NA M MALE 62 MD 1995-06-13 V
41 GUILFORD 000000191608 SHIELDS V E NA 8515 NA NA CIDER RD NA NA COLFAX NC 27235 NA NA M MALE 88 NC 1952-05-17 V
8 BERTIE 000000010892 SLADE V T NA 702 NA NA PERRYTOWN RD NA NA COLERAIN NC 27924 NA NA M MALE 90 NC 1965-10-09 V
39 GRANVILLE 000000010610 TEMPLE V W NA 1600 NA NA NC HWY 50 NA NA NA CREEDMOOR NC 27522 NA NA M MALE 87 NC 1946-05-11 V
68 ORANGE 000000117530 WARD V STUART JR 113 NA NA MOSSBARK LN NA NA CHAPEL HILL NC 27514 NA NA M MALE 80 VA 1992-09-30 V
23 CLEVELAND 000000042337 WHITE A V NA JR 126 NA NA MORIAH SCHOOL RD NA NA CASAR NC 28020 NA NA M MALE 57 NC 1984-09-26 V
64 NASH 000000053532 WHITSON V L NA 9678 NA NA OLD BAILEY HWY NA NA WILSON NC 27896 NA NA M MALE 70 NA 2000-07-26 V
99 YADKIN 000000015645 WOOTEN V ALDENE NA 4037 NA NA FORBUSH RD NA NA EAST BEND NC 27018 336 9614051 M MALE 76 NC 1966-05-14 V
92 WAKE 000009547600 WYATT V CHARLES NA 2206 NA NA BEECHRIDGE RD NA NA RALEIGH NC 27608 NA NA M MALE 57 NC 1972-01-01 V
41 GUILFORD 000000005042 ANTHONY VI JOHNSON NA 6100 NA W FRIENDLY AVE NA 1312 GREENSBORO NC 27410 NA NA F FEMALE 82 GA 1987-08-27 VI
32 DURHAM 000030060851 DO VI THUY NA 5303 NA NA ROSE GARDEN LN NA NA DURHAM NC 27707 NA NA F FEMALE 25 KS 2004-08-11 VI
23 CLEVELAND 000000012622 GREENE VI HEGE NA 2072 NA NA FALLSTON RD NA NA SHELBY NC 28150 NA NA F FEMALE 48 NC 1982-05-21 VI
92 WAKE 000031081313 HUTCHINSON VI THI NA 521 NA NA SNOWY EGRET TRL NA NA CARY NC 27511 NA NA F FEMALE 28 OC 2000-02-21 VI
41 GUILFORD 000000512394 LAI VI LE NA 3601 NA NA FLETCHER DR NA NA GREENSBORO NC 27406 NA NA F FEMALE 21 NA 2002-10-11 VI
26 CUMBERLAND 000000181026 NGUYEN VI THOAI NA 1104 NA NA PADDINGTON PL NA NA FAYETTEVILLE NC 28314 910 8683998 F FEMALE 25 VI 1999-03-25 VI
60 MECKLENBURG 000001297281 NGUYEN VI TUONG NA 2208 NA NA O’HARA DR NA NA CHARLOTTE NC 28273 NA NA F FEMALE 20 NA 2004-09-03 VI
78 ROBESON 000000112822 TOWNSEND VI S NA 210 NA NA GLASGOW DR NA NA LUMBERTON NC 28358 NA NA F FEMALE 48 NA 1976-10-04 VI
26 CUMBERLAND 000065994989 VO VI PHUONG NA 408 NA NA HADLEY CT NA NA FAYETTEVILLE NC 28301 910 4380800 F FEMALE 28 OC 2002-04-17 VI
95 WATAUGA 000000062620 GALLOWAY VI CKY RONALD NA 566 NA NA FOX DEN RD NA NA BLOWING ROCK NC 28605 828 2653093 M MALE 58 SC 2001-05-02 VI
41 GUILFORD 000009906859 THAI VI KY NA 1827 NA NA BEARHOLLOW RD NA NA GREENSBORO NC 27410 NA NA M MALE 59 NA 1999-07-12 VI
60 MECKLENBURG 000001110356 TRAN VI TAN NA 8617 NA NA HAMBRIGHT RD NA NA HUNTERSVILLE NC 28078 NA NA M MALE 57 NA 1999-09-28 VI

I eyeballed the results and removed words which appeared to be mostly validly used.

Invalid words:

  • As whole field: FATHER, III, IV, JR, MD, MR, MRS, SISTER, SR
  • As first word: DR, MISS, MRS, REV, SISTER
  • As last word: III, JR, MRS, NMN, SR
  • As internal word: MRS

5.4.3 Middle name

# regular expression to match words
w_regexp <- 
  c(w_hons, w_gen, w_spec, w_test) %>% # all special words
  unique() %>% # make it a set
  dplyr::setdiff( # remove words that appear to mostly be validly used
    c(
      "BISHOP",
      "BLIND",
      "BR",
      "BROTHER",
      "DOCTOR",
      "ELDER",
      "FIRST",
      "JR", # invalid & too many to display 
      "JUNIOR",
      "MASTER",
      "MISTER",
      "MRS", # invalid & too many to display
      "NMN", # invalid & too many to display
      "PASTOR",
      "SENIOR",
      "SISTER",
      "I",
      "V",
      "VI",
      "VOTER"
    )
  ) %>% 
  glue::glue(x = . , "\\b{x}\\b") %>%  # must be words
  glue::glue_collapse(sep = "|") # search for any

x <- d %>% 
  dplyr::mutate(
    match = 
      midl_name %>% 
      stringr::str_to_upper() %>% 
      stringr::str_replace_all(pattern = "[^ A-Z]", replacement = " ") %>% 
      stringr::str_squish() %>% 
      stringr::str_extract(pattern = w_regexp)
  ) %>% 
  dplyr::filter(!is.na(match))

nrow(x)
[1] 98
x %>% 
  dplyr::arrange(match, sex, last_name, first_name) %>% 
  knitr::kable()
county_id county_desc voter_reg_num last_name first_name midl_name name_sufx_cd house_num half_code street_dir street_name street_type_cd street_sufx_cd unit_num res_city_desc state_cd zip_code area_cd phone_num sex_code sex age birth_place registr_dt match
28 DARE 000000047080 WISE DIANA AKA NA 279 NA NA OUTRIGGER DR NA NA KILL DEVIL HILLS NC 27948 NA NA F FEMALE 57 NA 1987-04-30 AKA
92 WAKE 000009930774 CACCAMO KATHLEEN DR NA 3405 NA NA BRENNAN DR NA NA RALEIGH NC 27613 NA NA F FEMALE 54 VA 1984-01-01 DR
60 MECKLENBURG 000001027324 DUNCAN ROSALYN DR NA 5516 NA NA TORRENCE ST NA NA CHARLOTTE NC 28269 704 5972761 F FEMALE 47 NC 1996-11-05 DR
60 MECKLENBURG 000001005420 GEORGE AMAY DR NA 5418 NA NA RUPERT LN NA NA CHARLOTTE NC 28215 704 5378422 F FEMALE 27 NC 1996-09-22 DR
19 CHATHAM 000000017933 VANN ELLEN DR NA 801 NA NA CLIFTWOOD DR NA NA SILER CITY NC 27344 919 6633939 F FEMALE 67 NA 1972-10-07 DR
34 FORSYTH 000002417640 ELESHA WILLIAM DR NA 3134 NA NA WATERFORD RD NA NA WINSTON SALEM NC 27106 NA NA M MALE 85 OC 1959-10-09 DR
60 MECKLENBURG 000000200565 ROBICSEK FRANCIS DR NA 2519 NA NA RICHARDSON DR NA NA CHARLOTTE NC 28211 NA NA M MALE 80 NA 1972-10-01 DR
51 JOHNSTON 000002764400 ROPER THOMAS E DR NA 1110 NA NA CHESTNUT DR NA NA SMITHFIELD NC 27577 919 9348651 M MALE 72 NA 1964-05-15 DR
77 RICHMOND 000000008054 VETTER JOHN S DR NA 1424 NA NA CUMBERLAND CIR NA NA ROCKINGHAM NC 28379 NA NA M MALE 77 TN 1958-05-17 DR
87 SWAIN 000000000455 BIRCHFIELD HARRY LYNN II NA 1234 NA NA JENKINS BRANCH RD NA NA BRYSON CITY NC 28713 NA NA M MALE 37 NC 1986-03-20 II
44 HAYWOOD 000000026398 DINGMAN LEONARD ALAN II NA 3523 NA NA DELLWOOD RD NA NA WAYNESVILLE NC 28786 828 9260368 M MALE 36 NC 1987-06-01 II
44 HAYWOOD 000000027292 FRADY ROBERT GLENN II NA 87 NA NA MAUNEY COVE RD NA NA WAYNESVILLE NC 28786 828 4522654 M MALE 39 NC 1988-03-04 II
98 WILSON 000000010981 GLOVER CHARLES WORTH II NA 4362 NA NA NC HWY 42 NA E A WILSON NC 27893 NA NA M MALE 49 NC 1976-02-13 II
19 CHATHAM 000000035226 HAWKINS ROGER LARRY II NA 1525 NA NA FARRINGTON RD NA NA APEX NC 27523 NA NA M MALE 35 NA 1994-08-08 II
42 HALIFAX 000000039335 HUNTER ERNEST II NA 194 NA NA CEDAR LN NA NA HALIFAX NC 27839 NA NA M MALE 33 NA 1991-07-23 II
98 WILSON 000000015334 KELLY DAVID LEE II NA 5041 NA NA CONIFER CT NA NA WILSON NC 27896 NA NA M MALE 48 WV 1984-10-05 II
42 HALIFAX 000000011122 KERR JAMES II NA 108 NA S RIVER RD NA NA ROANOKE RAPIDS NC 27870 NA NA M MALE 45 NA 1982-10-01 II
13 CABARRUS 000000060822 KUHNE KURT II NA 1502 NA NA 12TH FAIRWAY DR NW NA CONCORD NC 28027 NA NA M MALE 62 NA 1988-02-08 II
60 MECKLENBURG 000001267261 ROGERS SYLVESTER II SR 7138 NA NA RENA MAE LN NA NA CHARLOTTE NC 28227 704 5734924 M MALE 73 NA 2004-03-23 II
98 WILSON 000000023732 SHERWOOD GEORGE ROYALL II NA 907 NA NA WALNUT ST W NA WILSON NC 27893 NA NA M MALE 60 NC 1968-05-20 II
98 WILSON 000000048227 SOGLUIZZO JOSEPH JOHN II NA 3008 NA NA BRENTWOOD DR N NA WILSON NC 27896 252 2436056 M MALE 34 NC 1994-06-16 II
20 CHEROKEE 000000011078 VAN GORDER CHARLES OSCAR II NA 2 NA NA UNKNOWN NA NA NA ANDREWS NC 28901 NA NA M MALE 91 TN 1969-07-19 II
98 WILSON 000000047845 WALSTON CHARLES EDWARD II NA 4804 NA NA HATHAWAY RD NA NA WILSON NC 27893 252 2437963 M MALE 31 NC 1994-01-03 II
34 FORSYTH 000030011868 WATKINS MONROE II NA 1147 NA NA BANK ST NA NA WINSTON SALEM NC 27101 NA NA M MALE 30 NC 2000-04-07 II
44 HAYWOOD 000000030385 YOUNGMAN THOMAS ARDEN II NA 387 NA NA SOCO RD NA NA MAGGIE VALLEY NC 28751 828 9262801 M MALE 35 NC 1990-03-14 II
60 MECKLENBURG 000001334755 BROWN HARRY III NA 6000 NA NA FAIRVIEW RD NA 1200 CHARLOTTE NC 28210 NA NA M MALE 42 SC 2004-10-08 III
42 HALIFAX 000000053858 BROWN MILES III NA 60 NA NA FOXTAIL LN NA NA LITTLETON NC 27850 252 5864436 M MALE 56 NA 1998-04-03 III
25 CRAVEN 000000093264 COOPER DALTON III NA 819 NA NA QUEEN ST NA NA NEW BERN NC 28560 252 6362507 M MALE 23 NC 2004-10-01 III
42 HALIFAX 000000054806 DAILEY LANGRA III NA 123 NA NA OAKRIDGE DR NA NA ROANOKE RAPIDS NC 27870 NA NA M MALE 25 NA 1998-11-02 III
41 GUILFORD 000000268329 FUNDERBURK TRAVIS III NA 4102 NA NA BIRCHGARDEN DR NA NA HIGH POINT NC 27265 NA NA M MALE 32 NC 1992-11-30 III
1 ALAMANCE 000009056466 GADISON NATHANIEL III NA 1718 NA N COSBY CT NA NA BURLINGTON NC 27217 336 2276119 M MALE 63 NC 1997-04-09 III
66 NORTHAMPTON 000000011718 GAY ROBERT HENRY, III. NA 312 NA S MAIN ST NA NA SEABOARD NC 27876 252 5891914 M MALE 32 NA 1998-11-25 III
42 HALIFAX 000000054971 GEE LAWRENCE III NA 1050 NA NA REESES RD NA NA HALIFAX NC 27839 252 5363160 M MALE 53 NA 1998-11-19 III
42 HALIFAX 000000050502 HARPER GUS III NA 976 NA NA QUALLS FARM RD NA NA ENFIELD NC 27823 252 4455899 M MALE 37 NA 1996-10-07 III
1 ALAMANCE 000003089200 HOLT ISAAC III NA 2369 NA S NC HWY 87 NA NA NA GRAHAM NC 27253 NA 2267444 M MALE 52 NC 1972-10-07 III
69 PAMLICO 000000005041 HUMPHREY ROLAND M III NA 5395 NA NA KERSHAW RD NA NA ORIENTAL NC 28571 NA NA M MALE 39 NC 1984-05-02 III
42 HALIFAX 000000051692 JOHNSON SHADE III NA 266 NA NA CEDAR LN NA NA HALIFAX NC 27839 252 5865191 M MALE 32 NA 1997-05-06 III
42 HALIFAX 000000042552 JOYNER DOUGLAS III NA 46 NA NA LINCOLN ST NA NA ROANOKE RAPIDS NC 27870 NA NA M MALE 30 NA 1994-03-24 III
42 HALIFAX 000000040051 LYNCH ABRAHAM III NA 212 NA NA YOUNG RD NA NA ENFIELD NC 27823 NA NA M MALE 31 NA 1992-04-02 III
1 ALAMANCE 000004437150 MCGILVERY ROBERT III NA 1755 NA NA PAYNE RD NA NA GRAHAM NC 27253 NA 0000000 M MALE 46 NJ 1978-09-25 III
60 MECKLENBURG 000001005992 MCILWAIN FERRY III NA 7220 NA NA MEADOWLAND DR NA NA CHARLOTTE NC 28215 NA NA M MALE 41 NC 1996-09-12 III
78 ROBESON 000000181261 PHILLIPS ALEXANDER ROW III NA 308 NA W 18TH ST NA NA LUMBERTON NC 28358 NA NA M MALE 40 NA 1992-07-24 III
42 HALIFAX 000000043187 PRICE PAUL III III 8 NA NA FRANKLIN ST NA NA ROANOKE RAPIDS NC 27870 252 4100472 M MALE 38 NC 1994-12-16 III
34 FORSYTH 000030046228 STEELE HARVEY III NA 3340 NA NA WOLFE STORE LN NA NA PFAFFTOWN NC 27040 NA NA M MALE 34 NY 2003-02-10 III
91 VANCE 000000102353 TERRY GEORGE III NA 302 NA NA BODDIE ST NA NA HENDERSON NC 27536 252 4322001 M MALE 21 NC 2004-08-26 III
60 MECKLENBURG 000001305328 THOMAS PAUL III NA 5025 NA NA OSAGE CIR NA NA CHARLOTTE NC 28269 704 5982270 M MALE 45 NC 2004-09-16 III
92 WAKE 000031137907 BAKER LOUIS IV NA 1110 NA NA PARK AVE NA NA GARNER NC 27529 NA NA M MALE 26 FL 2001-05-21 IV
59 MCDOWELL 000000003387 CROSS EUGENE IV NA 594 NA NA HIDDEN VIEW LOOP NA NA MARION NC 28752 NA NA M MALE 46 NC 1980-03-14 IV
18 CATAWBA 000030036698 ESPOSITO VINCENT JOHN IV NA 5319 NA NA LISA LN NA NA HICKORY NC 28602 828 2942322 M MALE 33 CT 2005-06-23 IV
6 AVERY 000000008936 GUNNOE ROBERT FELIX IV NA 0 NA NA NC HWY 194 NA NA NA ELK PARK NC 28622 NA NA M MALE 40 NA 1986-03-17 IV
6 AVERY 000000011732 HORNEY HARRISON MARTIN IV NA 0 NA NA HORNEY RD NA NA NEWLAND NC 28657 NA NA M MALE 30 NC 1994-02-18 IV
100 YANCEY 000000013109 HUMBERT JOHN LAWRENCE IV NA 5786 NA NA STATE HIGHWAY 80 S NA NA NA BURNSVILLE NC 28714 NA NA M MALE 27 FL 1998-09-30 IV
81 RUTHERFORD 000030005769 BRONSON JENNIFER MD NA 4751 NA NA US HWY 221A NA NA NA MOORESBORO NC 28114 828 6576941 F FEMALE 22 CA 2002-02-08 MD
12 BURKE 000000030802 MCGIMSEY JAMES F JR MD NA 100 NA NA VALDESE AVE NA NA MORGANTON NC 28655 NA NA M MALE 87 NA 1968-03-30 MD
34 FORSYTH 000000717120 BOLES FAUSTINE MISS NA 4234 NA NA REIDSVILLE RD NA NA WINSTON SALEM NC 27101 NA NA F FEMALE 105 NC 1940-05-06 MISS
68 ORANGE 000000026794 BREEZE ALMA EARL MISS NA 300 NA NA MEADOWLANDS DR NA 112B HILLSBOROUGH NC 27278 NA NA F FEMALE 90 NC 1966-10-08 MISS
34 FORSYTH 000002044920 DAVIS JULIA MISS NA 2326 NA NA CLOVERDALE AVE NA D WINSTON SALEM NC 27103 NA NA F FEMALE 105 NC 1947-10-24 MISS
34 FORSYTH 000002867280 GARBER CORNELIA MISS NA 419 NA N STRATFORD RD NA NA WINSTON SALEM NC 27104 NA NA F FEMALE 105 NY 1947-04-09 MISS
34 FORSYTH 000003295920 HAM MABLE MISS NA 443 NA NA HASTINGS HILL RD NA NA KERNERSVILLE NC 27284 NA NA F FEMALE 77 NC 1952-10-25 MISS
34 FORSYTH 000005537890 MCKOY CAROL MISS NA 3017 NA NA CARR CT NA NA WINSTON SALEM NC 27105 NA NA F FEMALE 105 SC 1968-09-30 MISS
34 FORSYTH 000005885040 MORRISON LULA MISS NA 2315 NA NA MANCHESTER ST NA NA WINSTON SALEM NC 27105 NA NA F FEMALE 105 NC 1947-04-12 MISS
34 FORSYTH 000005906500 MOSER ROSE MISS NA 2418 NA NA PATRIA ST NA NA WINSTON SALEM NC 27127 NA NA F FEMALE 105 NC 1952-10-11 MISS
34 FORSYTH 000006506450 PHILSON CHERYL MISS NA 1060 NA N CLAREMONT AVE NA NA WINSTON SALEM NC 27101 NA NA F FEMALE 105 NC 1968-09-23 MISS
6 AVERY 000000002024 ATKINS DAVID GLEN MR NA 0 NA NA BIRD ST NA NA NEWLAND NC 28657 NA NA M MALE 47 NC 1984-09-12 MR
34 FORSYTH 000005004360 LIVENGOOD THURMOND MS NA 46 NA NA ROBINHOOD CIR NA NA WINSTON SALEM NC 27106 NA NA F FEMALE 105 NC 1960-10-18 MS
28 DARE 000000733602 STINTZI MANDI LY NN NA 3137 NA NA RAYMOND AVE NA NA KILL DEVIL HILLS NC 27948 NA NA F FEMALE 30 WI 2001-05-31 NN
19 CHATHAM 000000035665 CRISSMAN JASON LY NN NA 466 NA NA ZEB BROOKS RD NA NA BEAR CREEK NC 27207 919 8378793 M MALE 33 NC 1994-10-13 NN
50 JACKSON 000000009299 GREENE LESTER D(NN) NA 206 NA NA RURAL RTE 1 NA NA NA WHITTIER NC 28789 NA NA M MALE 48 NC 1976-02-23 NN
50 JACKSON 000000016423 LUKER DANIEL B(NN) NA 935 NA NA LUKER BR NA NA TUCKASEGEE NC 28783 828 2939504 M MALE 51 NC 1986-03-07 NN
51 JOHNSTON 000002441360 JOHNSON ROBERT REV NA 912 NA W WALNUT ST NA NA SELMA NC 27576 919 9652673 M MALE 0 NA 1966-05-14 REV
60 MECKLENBURG 000000118296 WORKMAN NATHANIEL REV NA 4638 NA NA HIDDEN VALLEY RD NA NA CHARLOTTE NC 28213 NA NA M MALE 68 NA 1968-10-01 REV
90 UNION 000000173230 ABBAS MOHAMED SR NA 2206 NA NA HONEYCUTT SIMPSON RD NA NA MONROE NC 28110 NA NA M MALE 54 OC 2002-07-30 SR
42 HALIFAX 000000046991 ANSTEAD LENDELL SR NA 41356 NA NA NC HWY 561 NA NA NA HOLLISTER NC 27844 252 5862006 M MALE 35 NA 1996-01-18 SR
42 HALIFAX 000000051888 ANTHONY EVERETT SR NA 22 NA NA LEOPARD LN NA NA SCOTLAND NECK NC 27874 NA NA M MALE 78 NA 1997-07-09 SR
42 HALIFAX 000000000667 ARMSTON MILTON SR NA 446 NA E 2ND ST NA NA HOBGOOD NC 27843 NA NA M MALE 59 NA 1982-04-23 SR
42 HALIFAX 000000052507 ARRINGTON LEROY SR NA 319 NA NA CLEMENT DR NA NA ENFIELD NC 27823 NA NA M MALE 71 NA 1997-10-02 SR
64 NASH 000000063300 BATTLE NATHANIEL SR NA 220 NA NA OVERTON DR NA NA ROCKY MOUNT NC 27804 252 4436003 M MALE 62 NC 2001-02-06 SR
14 CALDWELL 000014480064 BERRY RALPH SR NA 2520 NA NA COLLETTSVILLE RD NA NA LENOIR NC 28645 828 4466330 M MALE 64 NA 2004-09-15 SR
42 HALIFAX 000000036029 BROWN NELSON SR NA 471 NA NA REV ALEX BROWN RD NA NA ROANOKE RAPIDS NC 27870 NA NA M MALE 79 NA 1989-08-29 SR
1 ALAMANCE 000001047400 CARTER FOREST SR NA 2361 NA NA ANGIER TRL NA NA GREEN LEVEL NC 27217 NA 5781785 M MALE 93 NC 1960-05-06 SR
42 HALIFAX 000000049416 CLARK JEFFERY SR NA 210 NA NA LIBERTY ST NA NA ENFIELD NC 27823 252 4455279 M MALE 43 NA 1996-08-24 SR
1 ALAMANCE 000009007830 DEGRAFFENRIED EDWARD (NMN)SR NA 1411 NA N PARK AVE NA NA BURLINGTON NC 27217 NA 2294274 M MALE 75 NC 1984-10-04 SR
32 DURHAM 000030058744 EUBANKS ALBERT SR NA 1915 NA NA SHERMAN AVE NA NA DURHAM NC 27707 NA NA M MALE 58 NC 2004-08-12 SR
27 CURRITUCK 000000003694 HARRIS MARION SR NA 0 NA NA DOWDY’S BAY RD NA NA GRANDY NC 27939 252 4538588 M MALE 74 NC 1952-05-10 SR
69 PAMLICO 000000004795 JOHNSON FRED ALAN SR NA 605 NA NA CHINCHILLA DR NA NA BAYBORO NC 28515 NA NA M MALE 58 KS 1968-10-26 SR
27 CURRITUCK 000000001156 JONES WALTER SR NA 171 NA NA NEWTOWN RD NA NA MOYOCK NC 27958 252 4356558 M MALE 89 NC 1956-05-05 SR
42 HALIFAX 000000052663 LANE LORENZA SR NA 1503 NA NA RAILROAD ST NA NA SCOTLAND NECK NC 27874 252 8265014 M MALE 44 NA 1997-10-06 SR
69 PAMLICO 000000000541 LUPTON DENNIS WAYNE SR NA 66 NA NA OLD LUPTON RD NA NA MERRITT NC 28556 NA NA M MALE 59 WI 1974-06-27 SR
42 HALIFAX 000000002176 LYNCH LOUIS SR NA 28734 NA NA NC HWY 561 NA NA NA ENFIELD NC 27823 NA NA M MALE 69 NA 1974-04-08 SR
54 LENOIR 000000007278 MILLER CLARENCE SR NA 107 NA NA EMMA WEBB PARK DR NA NA KINSTON NC 28501 252 5276713 M MALE 54 NC 1999-06-23 SR
60 MECKLENBURG 000001048790 OSBORNE JOHN SR NA 2501 NA NA HEMBY WOODS DR NA NA CHARLOTTE NC 28262 NA NA M MALE 65 NY 1997-09-27 SR
57 MADISON 000000002840 PERAGINE PAUL SR NA 6217 NA NA PAINT FORK RD NA NA MARS HILL NC 28754 828 6899961 M MALE 48 NY 1998-11-03 SR
1 ALAMANCE 000009033642 SELLARS LARRY SR NA 2215 NA NA MELROSE DR NA NA BURLINGTON NC 27217 NA 5703680 M MALE 54 NC 1992-10-03 SR
78 ROBESON 000000231404 STRICKLAND TIMOTHY SR NA 1294 NA NA HARPERS FERRY RD NA NA MAXTON NC 28364 NA NA M MALE 86 NA 1964-10-17 SR
41 GUILFORD 000000228111 WHITAKER WILLIAM SR NA 2204 NA NA PEAR ST NA NA GREENSBORO NC 27401 NA NA M MALE 82 WV 1952-05-03 SR
69 PAMLICO 000000007117 WHITNEY WILLIAM PRESTON SR NA 1444 NA NA SILVERBROOK RD NA NA ORIENTAL NC 28571 NA NA M MALE 81 NC 1968-10-05 SR
42 HALIFAX 000000049427 WIGGINS MINOR SR NA 613 NA NA GREENWOOD ST NA NA SCOTLAND NECK NC 27874 NA NA M MALE 79 NA 1996-08-31 SR
66 NORTHAMPTON 000000013607 WILLIAMS ERVIN W., SR., NA 0 NA NA ALSTON ST NA NA GARYSBURG NC 27831 252 5362884 M MALE 38 NA 2000-09-20 SR

I eyeballed the results and removed words which appeared to be mostly validly used.

Invalid words:

  • As whole field: AKA, DR, II, III, IV, JR, MD, MISS, MRS, MS, NMN, REV, SR
  • As first word: JR, MRS
  • As last word: DR, II, III, IV, JR, MD, MISS, MR, MRS, NMN, NN, SR
  • As internal word: JR

Timing

Computation time (excl. render): 1092.598 sec elapsed

sessionInfo()
R version 4.0.3 (2020-10-10)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 20.10

Matrix products: default
BLAS:   /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.9.0
LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.9.0

locale:
 [1] LC_CTYPE=en_AU.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_AU.UTF-8        LC_COLLATE=en_AU.UTF-8    
 [5] LC_MONETARY=en_AU.UTF-8    LC_MESSAGES=en_AU.UTF-8   
 [7] LC_PAPER=en_AU.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_AU.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] stats     graphics  grDevices datasets  utils     methods   base     

other attached packages:
 [1] tidyr_1.1.3      ggplot2_3.3.3    forcats_0.5.1    lubridate_1.7.10
 [5] skimr_2.1.3      stringr_1.4.0    tibble_3.1.0     vroom_1.4.0     
 [9] fs_1.5.0         tictoc_1.0       here_1.0.1       workflowr_1.6.2 
[13] targets_0.3.1   

loaded via a namespace (and not attached):
 [1] Rcpp_1.0.6        ps_1.6.0          assertthat_0.2.1  rprojroot_2.0.2  
 [5] digest_0.6.27     utf8_1.2.1        R6_2.5.0          repr_1.1.3       
 [9] evaluate_0.14     highr_0.8         pillar_1.5.1      rlang_0.4.10     
[13] rstudioapi_0.13   data.table_1.14.0 whisker_0.4       callr_3.6.0      
[17] jquerylib_0.1.3   rmarkdown_2.7     labeling_0.4.2    igraph_1.2.6     
[21] bit_4.0.4         munsell_0.5.0     compiler_4.0.3    httpuv_1.5.5     
[25] xfun_0.22         pkgconfig_2.0.3   base64enc_0.1-3   htmltools_0.5.1.1
[29] tidyselect_1.1.0  bookdown_0.21     codetools_0.2-18  fansi_0.4.2      
[33] crayon_1.4.1      dplyr_1.0.5       withr_2.4.1       later_1.1.0.1    
[37] grid_4.0.3        jsonlite_1.7.2    gtable_0.3.0      lifecycle_1.0.0  
[41] DBI_1.1.1         git2r_0.28.0      magrittr_2.0.1    scales_1.1.1     
[45] cli_2.3.1         stringi_1.5.3     farver_2.1.0      renv_0.13.2      
[49] promises_1.2.0.1  bslib_0.2.4       ellipsis_0.3.1    vctrs_0.3.7      
[53] generics_0.1.0    tools_4.0.3       bit64_4.0.5       glue_1.4.2       
[57] purrr_0.3.4       parallel_4.0.3    processx_3.5.0    yaml_2.2.1       
[61] colorspace_2.0-0  knitr_1.31        sass_0.3.1