# markdown and latex keywords #####
--
.column
.columns
.noframenumbering
align
approx
baselineskip
bcol
bcols
begin
bmp
cbblue
cbred
cdot
center
circle
clip
column
columns
Cor
definecolor
draw
draw
ecol
ecols
ell
em
emp
en
end
exp
fbox
fboxsep
fill
flushright
fontsize
footnotesize
frac
geq
height
help
hfill
hideallsubsections
hline
href
hspace
hyperlink
ibawblue
ibawlink
includegraphics
infty
int
item
itemize
large
Large
ldots
left
Leftrightarrow
leq
lines
lll
llll
mathbb
mathbf
mathcal
mbox
minimum
minipage
myblue
neq
newcommand
newline
normalsize
only
overlay
parskip
phantom
pm
pnorm
pt
qquad
quad
raggedleft
rectangle
renewcommand
rho
right
rightarrow
scriptsize
sd
selectfont
sep
setlength
sim
small
subtitle
tableofcontents
tabular
textasciigrave
textbf
textcolor
textsf
texttt
textwidth
thick
tikzpicture
tikzstyle
times
type
uncover
unit
varepsilon
vfill
vspace
width
Xb
xsep
xshift
xy

# technical terms #####
accuracy
aesthetic
aesthetics
aggregating
analysis
AND
ANN
Anscombe-Quartett
arguments
artificial
BA
backend
backends
bag
bagged
Bagging
Bagging-Modell
base
Bayesian
bins
binwidth
Bitsequenz
Bootstrap
bootstrapped
Bootstrappings
boundaries
boundary
cachen
caching
caret
catalog
causation
character
Character-Vektor
characters
ChatGPT
chunk
chunks
class
classification
Classification-Modell
Clusteranzahl
clustering
Clustering-Algorithmen
Clustering-Methoden
Code-Snippets
coefficient
coefficients
component
confounder
confusion
Connection-Objekt
correlations
csv-Datei
csv-Dateien
CV
Dashboards
data
database
datasplitting
dbplyr
DBs
decision
deep
density
Density-Plot
descent
Dimensionsreduktion
directory
discriminate
dof
downsampling
dplyr
Dunn-Index
dx
EDA
encoding
equation
escape
escaped
escaping
estimation
Execution-Option
explained
expression
expressions
extratrees
facet
faceting
facets
factor
factors
FN
forest
FP
frame
frames
function
gemappt
geom
geoms
git
GitHub
ggplot
GPT
GPTs
gleichverteilt
gradient
grand
greedy
grid
GUI
guide
guides
Help-Pane
Hexbin-Plot
Hexbinplot
hidden
hierarchical
high-bias
hochdimensional
hochdimensionalen
ibawds
imbalance
import
importance
imputer
inference
intercept
Jitter
join
joins
JSON
jsonlite
Key-Value-Pairs
k-means
Klassifizierungsfehler
Klassifizierungsmodell
Klassifizierungsproblem
Klassifizierungsprobleme
Klassifizierungsproblemen
kmeans
kmeans-Cluster
knit
kNN
kNN-Modell
knnImpute
Konfidenzintervall
Kovariation
learning
Legendentitel
linkage
LLM
LLMs
LOESS
Log-Loss
logical
loss
low-variance
machine
MAE
mapping
mappings
markdown
markup
Markup-Sprache
metric
metrics
named
Nicht-Linearität
Niveauparameter
noise
numeric
NZV
observable
odbc
one-hot
options
OR
Out-of-bag
Outcome
Outcomes
overfitted
Oversampling
p-Hacking
package
packages
Pairs-Plot
panel
Parameter-Tuning
Parameter-Tunings
pattern
patterns
perceptron
performance
pipe
pipes
pivoting
pkgs
plot
Plotausgabe
Plotobjekt
precision
preprocessing
preprocessings
pre-trained
prime
principal
QDA
quadratic
Quantilfunktion
Quantilschätzer
Quarto
Quarto-Datei
Quarto-Dateien
Quarto-Dokument
Quarto-Dokumenten
query
queries
random
RDS
recall
Regex
Regex-Tester
Regressionsmodell
Regressionsmodelle
regular
reinforcement
ReLu
repository
Residuenquadrate
Residuenquadrats
reveal.js
revealjs
RMariaDB
RMarkdown
RMarkdown-Dokumenten
RPostgres
rproj
RSQLite
rstudio
RStudio-Add-In
sampling
scaler
scales
scatter
Scatter-Plot
scatterplot
SD
SE
Seed
sensitivity
Setup-Chunk
shape
Shape-Skala
Shapefile
Shapefiles
shapes
significance
silhouette
smoothing
source
specificity
splitting
spurious
SQL
SQLite
squared
standardnormalverteilte
STATA
Statistics
stratification
stratified
Stratifikation
stratifiziere
stringr
summaries
summary
Summary-Funktionen
supervised
template
templates
term
theme
themes
tibble
tibbles
tidy
Tidy-Format
tidying
tidyr
tidyverse
TN
tokenisation
TP
train
Trainingsdaten
transformers
tree
tunen
tuning
undersampling
uniting
unsupervised
upsampling
Urnenziehung
validation
Validierungsdaten
vec
vignette
vignettes
Vorhersagefehler
Vorhersagemodelle
Voronoi-Diagramm
wrangling
XOR
YAML
YAML-Header
ZGWS
Zielvariable
Zielvariablen

# foreign language terms #####
about
across
address
ahead
Ain
and
another
answers
are
area
Armadillo
armed
as
assignments
at
attention
available
average
background
balanced
be
behind
below
best-case
between
binary
black
blue
bots
bottom
by
cabin
carriage
case
ceci
centered
characteristics
cheat
cheatsheet
check
cleaning
code
community
competition
complete
comprehensive
computer
computers
concatenate
conflict
consistent
cookbook
core
corners
cost
create
creating
cross
curve
cutting
cycle
dark
default
democratic
demographic
dentition
device
diamond
dictionaries
dictionary
different
display
distance
does
dotted
each
easy
edge
either
English
error
est
every
execute
experience
exploratory
factbook
faithful
fallacy
false
feed
fertility
figure
filtering
for
free
from
general
generate
graditional
graphics
green
grey
gun
have
header
history
humans
if
image
imply
improves
incidentally
index
infinity
inner
intelligence
internals
introduction
is
Issue
its
label
labs
language
layer
layern
learn
line
look
majority
manipulate
marital
matching
math
mathematics
mean
meant
measure
measured
method
mind
mixed
model
modeling
modelling
Modify
mu
murders
mutating
need
network
new
node
not
number
object
objects
observation
observational
of
old
on
open
opt-out
or
ou
overview
pace
paint
pas
path
people
People
physicists
playground
plot
points
polls
practical
practices
prediction
predictions
predictive
probability
program
programs
project
projects
proxy
questionable
questions
quote
radius
read
reduction
refine
reports
republic
research
respect
root
rounded
row
rule
said
samples
science
scientist
Seals
search
section
sections
seed
sentences
separating
sequences
settings
setup
sheets
shift
short
should
similar
similarity-statement
single
slide
slides
snake
some
species
specific
specifications
split
spread
spreads
state
states.
statistic
stopping
structure
Styling
sum
Swiss
table
tasks
teachable
text
the
they
third-cause
This
thread
threads
tiny
to
top
traditional
transferable
transforming
trees
trim
true
two
under
une
university
urn
use
validated
value
very
visualisation
visualising
Visualization
visualize
Walrus
weights
what
white
with
workflow
working
world
worst-
yet
you
your


# file endings #####
.csv
.jpg
.json
.parquet
.png
.R
.rds
.Rmd
.qmd
jpg
png
Rmd
rmd
qmd


# file names (or parts of them) #####
approach.png
cats.png
ChangRGraphicsCookbook.jpg
ChangRGraphicsCookbook.png
data-science-wrangle.png
dataset.png
descent.png
dimensions.jpg
double-zero-roulette.png
eda-boxplot.png
file.png
geron
headers.png
InferenceAndModelingRequirements.png
IrizarryDataScience.png
knit.png
lists.png
logo.jpeg
logo.png
LovelaceGeocompR.png
options.png
out.png
pics
ProbabilityRequirements.png
process.png
project.png
properties.png
ranger.png
RCoreIntroduction.png
regression.png
reinforcement.png
RInternals.png
rmarkdownflow.png
rmd.png
roc
RStudio.png
settings.png
significant.png
sl.jpg
styler-add-in.png
text.png
tidy.png
type.png
unlabelled.png
wd.png
WickhamAdvancedR.png
Wickhamggplot
WickhamRDataScience.png
WickhamRPackages.png
YihuiRMarkdown.png

# proper names #####
AlphaGo
America
Andersons
Bache
Bayes
Bell
BKW
Brewer
Caribbean
Chang
Christof
CIFAR-
Coale
ColorBrewer
Congo
ConvNetJS
Cynthia
Doug
Dunn
ECMA
ECMA-
Ella
Excel
Farcaster
Fred
Galton
Galtons
Gapminder
Garrett
Git
Grolemund
Géron
Hadley
IEC
Irizarry
JasonAizkalns
Kaggle
Keras
Knuth
Lanz
Lovelace
Magritte
McCulloch
McIlroy
Milton
Mitchell
MNIST
Overflow
Pitts
Posit
PubMed
PyTorch
pytorch
RColorBrewer
RStudio
Schüpbach
Scikit-Learn
Stack
TensorFlow
tensorflow
Torch
UCI
UK
Ward
Wickham
Xie
Yellowstone
Yihui

# other #####
-
-Achse
-Datei
-Daten
-dimensional
-fold
-Gemeinden
-Gruppe
-ige
-iger
-Installation
-Konfidenzintervall
-Konfidenzintervalle
-Modell
-Objekt
-Operator
-p
-Package
-Quantil
-Quantile
-Schleife
-seitigen
-t
-te
-Tab
-ten
-Verteilung
-Wert
BY-SA
Carnivora
CC
csv-
ct
cu.in.
ddr
Domänenwissen
dt.
Embarked
Excel-
FOSS
gallon
geclusterten
gefittet
gefitteten
gematched
gematcht
Go-Engine
gut-
Hands-On
hilfreicheren
hintereinandergereiht
hp
HTML-
id
ID
Ig-Nobelpreis
inch
Inputdaten
Ja-
k-
Kaugummigeschmack
lb
matchen
matchende
matchenden
matcht
max
mile
min
n-
opendata.swiss
Pannel
PassengerId
performt
PhD
pop
Präsenztag
Präsenztage
Präsenztagen
Pönale
Reminder
residuen-
Resultatvektors
sche
Schliffqualität
Shortcuts
TB-Fälle
Training-
ty
Uploadlink
up-
Vorlesungsfolien
Wikibooks
www.auto-data.net
x-
ÖV
überfittet

# labels for cross references #####
bestimmtheitsmass-r
bestimmtheitsmass-r
erwartungswert
erwartungswert-und-standardabweichung-im-urnenmodell
galtons-daten-mit-regressionsgeraden
normalverteilung
rechenregeln-fuxfcr-erwartungswert-und-standardabweichung
schuxe
sse-des-sohnes-aus-dem-linearen-modell
standardabweichung
standardfehler-der-regression-
tzer-fuxfcr-erwartungswert-und-standardabweichung
tzung-der-gruxf
uxfcbung-korrelation-regressionsgerade
uxfcbung-zgws-

# word pieces because of formulations like **d**igits #####
bv
cr
earest
eighbours
esults
fter
igits
nown
ord
ypothesising
