Open source
Selected R packages
- TabulaPDF: R bindings to the
Tabula Java library for PDF tables extraction.
- Redatam: Read REDATAM binary data directly
in R. This is similar to the Haven package for SPSS/Stata datasets but for REDATAM formats.
- Cpp11tesseract: R bindings to the
Tesseract C++ library for optical character recognition (OCR).
- Freedomhouse: A tidy version of Freedom House
datasets with added ISO country codes and texts with the sub-item justification.
- Capybara: Efficient
Fixed-Effects Estimation in R With C++ 11 Backend.
- Cpp11armadillo: Provides function declarations
and inline function definitions that facilitate communication between R and the Armadillo C++ library for
linear algebra and scientific computing.
- Cpp11eigen: Provides function declarations
and inline function definitions that facilitate communication between R and the Eigen C++ library for
linear algebra and scientific computing.
- DESTA (R version): The Design of Trade Agreements
Database.
- USITC Gravity: Database Adapted From the
International Trade and Production Database for Estimation (ITPD-E) and Dynamic Gravity Dataset (DGD).
- Gravity: Estimation methods for gravity models.
- Tradepolicy: Replication of an advanced guide
to
trade policy analysis.
- Tradestatistics: Open trade statistics API
wrapper and utility program.
- Pointblank: Data validation and organization of
metadata for local and remote tables.
CRAN
I maintain the WebTechnologies view
and
the official mirror for Chile.
LaTeX
I created the R package varsityblues, which allows
you
to write assignments, presentations and thesis in RStudio by following UofT formatting. This allows the user
to
avoid copy-pasting tables or plots from R outputs into Word/LaTeX documents. Instead, it allows the user to
work
with notebooks that they can export to PDF, and it configures the LaTeX setup automatically.
This package is based on the LaTeX style files that were sent to me by the late Professor Kim C. Border.
Cloud computing
I created images for RStudio Server and RStudio Server + Kubernetes,
which
allows you to create virtual machines on DigitalOcean and to
skip
the setup time to install the Tidyverse and other packages. With these images, it takes around 30 seconds to
have a ready to go setup for Data Science. This combines well with analogsea as I explained in my blog.
In simple terms, this allows you to rent a supercomputer, for example, with 48 cores and 200 GB in RAM, for a
reasonable price and minimal waiting.