Open Source Software and Open Data
International Trade
- Tradestatistics: Open trade Statistics API
wrapper and utility program.
- Wbstats: An R package for searching
and downloading data from the World Bank API
Econometrics
- Capybara: Efficient
Fixed-Effects Estimation in R With C++ 11 Backend.
- Gravity: Estimation methods for gravity models.
- Tradepolicy: Replication of an advanced guide
to trade policy analysis.
R and C++ bindings
- Cpp4r: A collection of C++ bindings for R that
defaults to C++11 and uses newer C++ features if those are supported.
Linear algebra
- Armadillo4r: Provides function declarations
and inline function definitions that facilitate communication between R and the Armadillo C++ library for
linear algebra and scientific computing. Significantly expands cpp11armadillo by providing support for
complex numbers and uses C++11 (and newer) features if available.
- Cpp11armadillo: Provides function declarations
and inline function definitions that facilitate communication between R and the Armadillo C++ library for
linear algebra and scientific computing.
Data visualization
- D3po: A set of opinionated templates for quick data
visualization using D3.js and R. It is fully compatible with RMarkdown and Shiny, and it is available
under the Apache 2.0 license for use in commercial and non-commercial projects.
REDATAM format
- Open REDATAM (C++/bash): Open Redatam is an open source
software for extracting raw information from REDATAM databases. It was created to recover information
of REDATAM databases for statistical analysis using standard tools such as SPSS, STATA, R, etc.
- Redatam for R: Read REDATAM binary data directly
in R. This is similar to the Haven package for SPSS/Stata datasets but for REDATAM formats.
- Redatam for Python: Read REDATAM binary data directly
in R. This is similar to the Haven package for SPSS/Stata datasets but for REDATAM formats.
Data validation
Pointblank: Data validation and organization of
metadata for local and remote tables.
Working with PDFs
- TabulaPDF: R bindings to the
Tabula Java library for PDF tables extraction.
- Cpp11tesseract: Bindings to Tesseract, a
powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is
highly configurable in order to tune the detection algorithms and obtain the best possible results.
- Cpp11poppler: Bindings to Poppler, a tool
for extracting text, fonts, attachments and metadata from a PDF file. It also supports high quality
rendering of PDF documents into PNG, JPEG, TIFF format, or into raw bitmap vectors for posterior
processing.
- Cpp11qpdf: Bindings to Qpdf, an open-source
PDF rendering library that allows to conduct content-preserving transformations of PDF files such as
split, combine, and compress PDF files.