Emily Riederer
Emily Riederer
About
Top Posts
Recent Posts
Talks
Projects
Publications
Tags
Posts
The Art of Abstraction in ETL: Dodging Data Extraction Errors
Cross-post from guest post on Airbyte’s developer blog
Mar 22, 2023
Goin' to Carolina in my mind (or on my hard drive)
Out-of-memory processing of North Carolina’s voter file with DuckDB and Apache Arrow
Sep 25, 2022
data
,
sql
Oh, I'm sure it's probably nothing
How we do (or don’t) think about null values and why the polyglot push makes it all the more important
Sep 5, 2022
rstats
,
python
,
sql
,
data
Update: grouped data quality check PR merged to dbt-utils
After a prior post on the merits of grouped data quality checks, I demo my newly merged implementation for dbt
Aug 26, 2022
data
,
changelog
,
dbt
Using databases with Shiny
Key issues when adding persistent storage to a Shiny application, featuring {golem} app development and Digital Ocean serving
Jan 2, 2022
rstats
,
shiny
,
data
How to Make R Markdown Snow
Much like ice sculpting, applying powertools to absolutely frivolous pursuits
Dec 11, 2021
rstats
,
rmarkdown
Make grouping a first-class citizen in data quality checks
Which of these numbers doesn’t belong? -1, 0, 1, NA. You can’t judge data quality without data context, so our tools should enable as much context as possible.
Nov 27, 2021
data
Why machine learning hates vegetables
A personal encounter with ‘intelligent’ data products gone wrong
Nov 10, 2021
data-disasters
Update: column-name contracts with dbtplyr
Following up on ‘Embedding Column-Name Contracts… with dbt’ to demo my new dbtplyr package to further streamline the process
Sep 21, 2021
data
,
changelog
,
dbt
A lightweight data validation ecosystem with R, GitHub, and Slack
A right-sized solution to automated data monitoring, alerting, and reporting using R (
pointblank
,
projmgr
), GitHub (Actions, Pages, issues), and Slack
Aug 26, 2021
rstats
,
data
»
Cite
×