
igraph - Network Analysis and Visualization
Routines for simple graphs and network analysis. It can handle large graphs very well and provides functions for generating random and regular graphs, graph visualization, centrality methods and much more.
Last updated 2 months ago
complex-networksgraph-algorithmsgraph-theorymathematicsnetwork-analysisnetwork-graphfortranlibxml2glpkopenblascpp
21.01 score 584 stars 1.9k dependents 31k scripts 543k downloads
rlang - Functions for Base Types and Core R and 'Tidyverse' Features
A toolbox for working with base types, core R features like the condition system, and core 'Tidyverse' features like tidy evaluation.
Last updated 3 months ago
19.98 score 517 stars 15k dependents 9.4k scripts 1.9M downloads
arrow - Integration to 'Apache' 'Arrow'
'Apache' 'Arrow' <https://arrow.apache.org/> is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. This package provides an interface to the 'Arrow C++' library.
Last updated 2 months ago
arrowparquetcurlopensslcpp
19.21 score 15k stars 83 dependents 10k scripts 324k downloadslater - Utilities for Scheduling Functions to Execute Later with Event Loops
Executes arbitrary R or C functions some time after the current time, after the R execution stack has emptied. The functions are scheduled in an event loop.
Last updated 4 months ago
cpp
15.76 score 143 stars 2.6k dependents 232 scripts 610k downloadsprophet - Automatic Forecasting Procedure
Implements a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. Prophet is robust to missing data and shifts in the trend, and typically handles outliers well.
Last updated 6 months ago
forecastingpythoncpp
15.53 score 19k stars 14 dependents 976 scripts 15k downloads
duckdb - DBI Package for the DuckDB Database Management System
The DuckDB project is an embedded analytical data management system with support for the Structured Query Language (SQL). This package includes all of DuckDB and an R Database Interface (DBI) connector.
Last updated 20 days ago
databaseduckdbolapcpp
13.79 score 159 stars 46 dependents 1.7k scripts 30k downloadslitedown - A Lightweight Version of R Markdown
Render R Markdown to Markdown (without using 'knitr'), and Markdown to lightweight HTML or 'LaTeX' documents with the 'commonmark' package (instead of 'Pandoc'). Some missing Markdown features in 'commonmark' are also supported, such as raw HTML or 'LaTeX' blocks, 'LaTeX' math, superscripts, subscripts, footnotes, element attributes, and appendices, but not all 'Pandoc' Markdown features are (or will be) supported. With additional JavaScript and CSS, you can also create HTML slides and articles. This package can be viewed as a trimmed-down version of R Markdown and 'knitr'. It does not aim at rich Markdown features or a large variety of output formats (the primary formats are HTML and 'LaTeX'). Book and website projects of multiple input documents are also supported.
Last updated 1 months ago
litedownmarkdownr-markdownreport-generatorreproducible-research
13.34 score 201 stars 177 dependents 14 scripts 122k downloads
cmdstanr - R Interface to 'CmdStan'
A lightweight interface to 'Stan' <https://mc-stan.org>. The 'CmdStanR' interface is an alternative to 'RStan' that calls the command line interface for compilation and running algorithms instead of interfacing with C++ via 'Rcpp'. This has many benefits including always being compatible with the latest version of Stan, fewer installation errors, fewer unexpected crashes in RStudio, and a more permissive license.
Last updated 4 days ago
bayesbayesianmarkov-chain-monte-carlomaximum-likelihoodmcmcstanvariational-inference
12.65 score 145 stars 9 dependents 5.2k scripts
targets - Dynamic Function-Oriented 'Make'-Like Declarative Pipelines
Pipeline tools coordinate the pieces of computationally demanding analysis projects. The 'targets' package is a 'Make'-like pipeline tool for statistics and data science in R. The package skips costly runtime for tasks that are already up to date, orchestrates the necessary computation with implicit parallel computing, and abstracts files as R objects. If all the current output matches the current upstream code and data, then the whole pipeline is up to date, and the results are more trustworthy than otherwise. The methodology in this package borrows from GNU 'Make' (2015, ISBN:978-9881443519) and 'drake' (2018, <doi:10.21105/joss.00550>).
Last updated 2 months ago
data-sciencehigh-performance-computingmakepeer-reviewedpipeliner-targetopiareproducibilityreproducible-researchtargetsworkflow
12.50 score 973 stars 22 dependents 13k downloadspolars - Lightning-Fast 'DataFrame' Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Last updated 16 days ago
arrowpolarsrust
12.01 score 501 stars 2 dependents 1.0k scripts 13 downloads
mirai - Minimalist Async Evaluation Framework for R
Designed for simplicity, a 'mirai' evaluates an R expression asynchronously in a parallel process, locally or distributed over the network. The result is automatically available upon completion. Modern networking and concurrency, built on 'nanonext' and 'NNG' (Nanomsg Next Gen), ensures reliable and efficient scheduling over fast inter-process communications or TCP/IP secured by TLS. Distributed computing can launch remote resources via SSH or cluster managers. An inherently queued architecture handles many more tasks than available processes, and requires no storage on the file system. Innovative features include support for otherwise non-exportable reference objects, event-driven promises, and asynchronous parallel map.
Last updated 14 days ago
asyncasynchronous-tasksconcurrencydistributed-computinghigh-performance-computingparallel-computing
11.86 score 217 stars 7 dependents 120 scripts 4.0k downloadsnanoarrow - Interface to the 'nanoarrow' 'C' Library
Provides an 'R' interface to the 'nanoarrow' 'C' library and the 'Apache Arrow' application binary interface. Functions to import and export 'ArrowArray', 'ArrowSchema', and 'ArrowArrayStream' 'C' structures to and from 'R' objects are provided alongside helpers to facilitate zero-copy data transfer among 'R' bindings to libraries implementing the 'Arrow' 'C' data interface.
Last updated 6 months ago
cpp
11.61 score 185 stars 27 dependents 37 scripts 21k downloadsadbcdrivermanager - 'Arrow' Database Connectivity ('ADBC') Driver Manager
Provides a developer-facing interface to 'Arrow' Database Connectivity ('ADBC') for the purposes of driver development, driver testing, and building high-level database interfaces for users. 'ADBC' <https://arrow.apache.org/adbc/> is an API standard for database access libraries that uses 'Arrow' for result sets and query parameters.
Last updated 1 months ago
cpp
11.32 score 415 stars 6 dependents 71 scripts 2.5k downloads
crew - A Distributed Worker Launcher Framework
In computationally demanding analysis projects, statisticians and data scientists asynchronously deploy long-running tasks to distributed systems, ranging from traditional clusters to cloud services. The 'NNG'-powered 'mirai' R package by Gao (2023) <doi:10.5281/zenodo.7912722> is a sleek and sophisticated scheduler that efficiently processes these intense workloads. The 'crew' package extends 'mirai' with a unifying interface for third-party worker launchers. Inspiration also comes from packages. 'future' by Bengtsson (2021) <doi:10.32614/RJ-2021-048>, 'rrq' by FitzJohn and Ashton (2023) <https://github.com/mrc-ide/rrq>, 'clustermq' by Schubert (2019) <doi:10.1093/bioinformatics/btz284>), and 'batchtools' by Lang, Bischel, and Surmann (2017) <doi:10.21105/joss.00135>.
Last updated 2 months ago
high-performance-computing
11.06 score 137 stars 2 dependents 240 scripts 1.9k downloadshttpgd - A 'HTTP' Server Graphics Device
A graphics device for R that is accessible via network protocols. This package was created to make it easier to embed live R graphics in integrated development environments and other applications. The included 'HTML/JavaScript' client (plot viewer) aims to provide a better overall user experience when dealing with R graphics. The device asynchronously serves graphics via 'HTTP' and 'WebSockets'.
Last updated 1 months ago
cpp
10.90 score 428 stars 1 dependents 284 scripts 4.0k downloadswebchem - Chemical Information from the Web
Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.
Last updated 2 years ago
cas-numberchemical-informationchemspideridentifierropensciwebscraping
10.27 score 167 stars 10 dependents 165 scripts 1.1k downloads
nanonext - NNG (Nanomsg Next Gen) Lightweight Messaging Library
R binding for NNG (Nanomsg Next Gen), a successor to ZeroMQ. NNG is a socket library for reliable, high-performance messaging over in-process, IPC, TCP, WebSocket and secure TLS transports. Implements 'Scalability Protocols', a standard for common communications patterns including publish/subscribe, request/reply and service discovery. As its own threaded concurrency framework, provides a toolkit for asynchronous programming and distributed computing. Intuitive 'aio' objects resolve automatically when asynchronous operations complete, and synchronisation primitives allow R to wait upon events signalled by concurrent threads.
Last updated 16 days ago
concurrencyhttpsipc-messagemessaging-librarynngrpcsocket-communicationsynchronization-primitivestcp-protocolwebsocketmbedtls
9.84 score 60 stars 9 dependents 32 scripts 5.7k downloads
tarchetypes - Archetypes for Targets
Function-oriented Make-like declarative pipelines for Statistics and data science are supported in the 'targets' R package. As an extension to 'targets', the 'tarchetypes' package provides convenient user-side functions to make 'targets' easier to use. By establishing reusable archetypes for common kinds of targets and pipelines, these functions help express complicated reproducible pipelines concisely and compactly. The methods in this package were influenced by the 'targets' R package. by Will Landau (2018) <doi:10.21105/joss.00550>.
Last updated 2 months ago
data-sciencehigh-performance-computingpeer-reviewedpipeliner-targetopiareproducibilitytargetsworkflow
9.09 score 143 stars 10 dependents 3.4k downloads
prqlr - R Bindings for the 'prqlc' Rust Library
Provides a function to convert 'PRQL' strings to 'SQL' strings. Combined with other R functions that take 'SQL' as an argument, 'PRQL' can be used on R.
Last updated 6 days ago
prqlrustsql
9.07 score 56 stars 18 scripts 23k downloads
unigd - Universal Graphics Device
A unified R graphics backend. Render R graphics fast and easy to many common file formats. Provides a thread safe 'C' interface for asynchronous rendering of R graphics.
Last updated 1 months ago
cairotifflibpngzlibcpp
8.01 score 23 stars 2 dependents 6 scripts 3.6k downloads
tidypolars - Get the Power of Polars with the Syntax of the Tidyverse
Polars is a cross-language tool for manipulating very large data. However, one drawback is that the R implementation has a syntax that will look odd to many R users who are not used to Python syntax. The objective of tidypolars is to improve the ease-of-use of Polars in R by providing tidyverse syntax to polars.
Last updated 24 days ago
7.86 score 198 stars 30 scriptsadbcflightsql - 'Arrow' Database Connectivity ('ADBC') 'FlightSQL' Driver
Provides a developer-facing interface to the 'Arrow' Database Connectivity ('ADBC') 'FlightSQL' driver for the purposes of building high-level database interfaces for users. 'ADBC' <https://arrow.apache.org/adbc/> is an API standard for database access libraries that uses 'Arrow' for result sets and query parameters.
Last updated 1 months ago
golang
7.81 score 415 stars 2 scriptsadbcsnowflake - Arrow Database Connectivity ('ADBC') 'Snowflake' Driver
Provides a developer-facing interface to the 'Arrow' Database Connectivity ('ADBC') 'Snowflake' driver for the purposes of building high-level database interfaces for users. 'ADBC' <https://arrow.apache.org/adbc/> is an API standard for database access libraries that uses 'Arrow' for result sets and query parameters.
Last updated 1 months ago
golang
7.81 score 415 stars 1 scriptsadbcbigquery - 'Arrow' Database Connectivity ('ADBC') 'BigQuery' Driver
Provides a developer-facing interface to the 'Arrow' Database Connectivity ('ADBC') 'BigQuery' driver for the purposes of building high-level database interfaces for users. 'ADBC' <https://arrow.apache.org/adbc/> is an API standard for database access libraries that uses 'Arrow' for result sets and query parameters.
Last updated 1 months ago
golang
7.81 score 415 stars 1 scripts
ichimoku - Visualization and Tools for Ichimoku Kinko Hyo Strategies
An implementation of 'Ichimoku Kinko Hyo', also commonly known as 'cloud charts'. Static and interactive visualizations with tools for creating, backtesting and development of quantitative 'ichimoku' strategies. As described in Sasaki (1996, ISBN:4925152009), the technique is a refinement on candlestick charting, originating from Japan and now in widespread use in technical analysis worldwide. Translating as 'one-glance equilibrium chart', it allows the price action and market structure of financial securities to be determined 'at-a-glance'. Incorporates an interface with the OANDA fxTrade API <https://developer.oanda.com/> for retrieving historical and live streaming price data for major currencies, metals, commodities, government bonds and stock indices.
Last updated 20 days ago
ichimokuichimoku-cloudoandaquantitative-finance
7.70 score 31 stars 32 scripts 1.6k downloads
jagstargets - Targets for JAGS Pipelines
Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the 'jagstargets' R package is leverages 'targets' and 'R2jags' to ease this burden. 'jagstargets' makes it super easy to set up scalable JAGS pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than 'targets' alone. For the underlying methodology, please refer to the documentation of 'targets' <doi:10.21105/joss.02959> and 'JAGS' (Plummer 2003) <https://www.r-project.org/conferences/DSC-2003/Proceedings/Plummer.pdf>.
Last updated 4 months ago
bayesianhigh-performance-computingjagsmaker-targetopiareproducibilityrjagsstatisticstargetscpp
6.95 score 10 stars 32 scripts 521 downloads
rwasm - Build R Packages for WebAssembly
Utility functions to help build R packages compiled for WebAssembly (Wasm), manage Wasm binary R package libraries and repositories, and prepare webR compatible filesystem images for static web hosting of data files and R packages.
Last updated 7 months ago
6.93 score 61 starsstring2path - Rendering Font into 'data.frame'
Extract glyph information from font data, and translate the outline curves to flattened paths or tessellated polygons. The converted data is returned as a 'data.frame' in easy-to-plot format.
Last updated 9 days ago
rustcargo
6.57 score 99 stars 11 scripts 4.0k downloads
geographr - R package for mapping UK geographies
A package to distribute and compute on UK geographical data.
Last updated 8 months ago
6.36 score 38 stars 402 scriptsbridgestan - BridgeStan, Accessing Stan Model Functions in R
BridgeStan provides efficient in-memory access to the methods of a Stan model, including log densities, gradients, Hessians, and constraining and unconstraining transforms.
Last updated 3 months ago
ccppjuliapythonstan
5.89 score 97 stars 8 scripts
stantargets - Targets for Stan Workflows
Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the 'stantargets' R package leverages 'targets' and 'cmdstanr' to ease these burdens. 'stantargets' makes it super easy to set up scalable Stan pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than 'targets' alone. 'stantargets' can access all of 'cmdstanr''s major algorithms (MCMC, variational Bayes, and optimization) and it supports both single-fit workflows and multi-rep simulation studies. For the statistical methodology, please refer to 'Stan' documentation (Stan Development Team 2020) <https://mc-stan.org/>.
Last updated 4 months ago
bayesianhigh-performance-computingmaker-targetopiareproducibilitystanstatisticstargets
5.42 score 49 stars 180 scriptstaxizedb - Tools for Working with 'Taxonomic' Databases
Tools for working with 'taxonomic' databases, including utilities for downloading databases, loading them into various 'SQL' databases, cleaning up files, and providing a 'SQL' connection that can be used to do 'SQL' queries directly or used in 'dplyr'.
Last updated 2 years ago
itistaxizetaxonomic-databasestaxonomy
5.40 score 31 stars 1 dependents 90 scripts 59 downloadsSBC - Simulation Based Calibration for rstan/cmdstanr models
SBC helps perform Simulation Based Calibration on Bayesian models. SBC lets you check for bugs in your model code and/or algorithm that fits the model. SBC focuses on models built with 'Stan' <https://mc-stan.org>, but can support other modelling languages as well.
Last updated 1 years ago
diagnosticssimulation-based-inference
5.08 score 56 stars 86 scripts
crew.aws.batch - A Crew Launcher Plugin for AWS Batch
In computationally demanding analysis projects, statisticians and data scientists asynchronously deploy long-running tasks to distributed systems, ranging from traditional clusters to cloud services. The 'crew.aws.batch' package extends the 'mirai'-powered 'crew' package with a worker launcher plugin for AWS Batch. Inspiration also comes from packages 'mirai' by Gao (2023) <https://github.com/shikokuchuo/mirai>, 'future' by Bengtsson (2021) <doi:10.32614/RJ-2021-048>, 'rrq' by FitzJohn and Ashton (2023) <https://github.com/mrc-ide/rrq>, 'clustermq' by Schubert (2019) <doi:10.1093/bioinformatics/btz284>), and 'batchtools' by Lang, Bischl, and Surmann (2017). <doi:10.21105/joss.00135>.
Last updated 2 months ago
aws-batchcrewhigh-performance-computing
4.99 score 15 stars 735 downloads
healthyr - R package for mapping UK health data
A package to distribute and summarise on UK health data.
Last updated 7 months ago
4.61 score 4 stars 103 scripts
asylum - Data on Asylum and Resettlement for the UK
Data on Asylum and Resettlement for the UK, provided by the Home Office <https://www.gov.uk/government/statistical-data-sets/immigration-system-statistics-data-tables>.
Last updated 1 years ago
4.21 score 3 stars 36 scripts 663 downloads
demographr - R package for mapping UK demographics
A package to distribute UK demographic data.
Last updated 6 months ago
4.08 score 2 stars 67 scriptsglaredb - R Bindings for 'GlareDB'
R bindings for 'GlareDB', an analytical database based on 'Apache Arrow' and 'Apache DataFusion' that can connect to various data sources and execute queries.
Last updated 6 months ago
arrowglaredbrustsql
3.68 score 3 stars 6 scripts
milRex - Query Military Expenditure Data From SIPRI
Queries data from the Military Expenditure Database by SIPRI.
Last updated 5 months ago
3.60 score 2 stars 1 scriptstargetsketch - Create and Visualize Targets Pipelines
A Shiny app for creating and visualizing targets pipelines (<https://github.com/wlandau/targets>). Simply supply a _targets.R file with the pipeline and supporting functions, and the app shows a dependency graph and manifest.
Last updated 1 years ago
data-sciencehigh-performance-computingpipelinereproducibilityshinytargetsworkflow
3.41 score 17 stars 1 scriptsmultiverse.internals - Internal Infrastructure for R-multiverse
R-multiverse requires this internal infrastructure package to automate contribution reviews and populate universes.
Last updated 18 days ago
3.30 score 1 stars 1 scriptswebseq - Access data from biological sequence databases like NCBI, ENA, MGnify
This package interacts with online biological sequence databases. It provides functions to search for sequences, convert identifiers and download sequences and associated metadata.
Last updated 1 years ago
3.18 score 3 stars 1 scripts
loneliness - A Loneliness Prescription Index for the UK
A loneliness prescription index for the UK.
Last updated 1 years ago
2.71 score 2 stars 17 scriptsaudio.vadwebrtc - Voice Activity Detection using the 'webrtc' Toolkit
Voice Activity Detection using the 'webrtc' toolkit. Identify the locations in audio files where there is an active voice. The is done based on a Gaussian Mixture Model implemented in the 'webrtc' framework.
Last updated 11 months ago
cpp
1.70 score 1 stars 6 scriptstidytensor - TidyTensor
Provides functions for working with tensors (vectors, matrices, or multidimensional arrays) as named hierarchical data structures.
Last updated 3 years ago
2.30 score 4 stars 1 scripts