14  Open Source Software

This is a list of the open source software that USACE geospatial data scientists depend on. All of this software is best-of-breed and can be used at no cost with no proprietary lock-in. Use this page to explore options for skill development on your projects.

Status Check (USACE users only)

14.1 Languages

Miniforge Python Distribution

  • Definition: A lightweight conda distribution that installs only the conda package manager and the conda-forge channel by default.
  • Purpose: Bootstrap a minimal, reproducible Python environment for non-geospatial projects without the overhead of a full Anaconda installation.
  • Audience: Data scientists who need to quickly set up a clean Python environment.
  • URL: conda-forge.org

Node.js

  • Definition: A cross-platform JavaScript runtime built on Chrome’s V8 engine that executes JavaScript outside a browser.
  • Purpose: Build and run JavaScript-based full-stack web applications and developer tooling.
  • Audience: Developers building web applications or working with JavaScript-based data visualization tools.
  • URL: nodejs.org

R

  • Definition: A language and environment for statistical computing and graphics, maintained by the R Core Team.
  • Purpose: Perform statistical analysis, data manipulation, and visualization; author reproducible reports and Shiny applications.
  • Audience: All USACE geospatial data scientists using R-based workflows.
  • URL: r-project.org

RTools

  • Definition: A collection of build tools required to compile R packages from source on Windows.
  • Purpose: Install R packages that require compilation, including those with C, C++, or Fortran code.
  • Audience: R users on Windows who need to build packages from source.
  • URL: cran.r-project.org/bin/windows/Rtools

14.2 IDEs

VS Code

  • Definition: A source-available code editor produced by Microsoft, built on the MIT-licensed Code - OSS project, with broad language and extension support.
  • Purpose: Write, debug, and manage code across multiple languages using a lightweight, highly extensible editor with a large extension ecosystem.
  • Audience: Developers and data scientists who need a general-purpose editor that supports Python, JavaScript, R, and other languages beyond what Positron or RStudio cover.
  • URL: code.visualstudio.com

Positron

  • Definition: A VS Code-based integrated development environment (IDE) designed specifically for data science workflows.
  • Purpose: Write, run, and debug R and Python code in a unified IDE with first-class support for both languages.
  • Audience: Data scientists who work across R and Python and want a modern, extensible IDE.
  • URL: positron.posit.co

RStudio Desktop

  • Definition: An open source IDE built for the R language, produced by Posit.
  • Purpose: Write and execute R code, manage projects, and author Quarto and R Markdown documents in a dedicated R environment.
  • Audience: Data scientists primarily working in R who prefer a purpose-built R IDE.
  • URL: posit.co/products/open-source/rstudio

14.3 Version Control

Git for Windows

  • Definition: A Windows port of the Git distributed version control system, bundled with Git Bash, Git GUI, and Git Credential Manager.
  • Purpose: Track code changes, manage branches, sign commits, and collaborate on repositories hosted on GitHub or other Git platforms.
  • Audience: All USACE geospatial data scientists who maintain code in Git repositories.
  • URL: gitforwindows.org

14.4 Databases

PostgreSQL

  • Definition: An open source object-relational database system with over 35 years of active development.
  • Purpose: Store, query, and manage structured geospatial and tabular data using SQL; extend with PostGIS for spatial data support.
  • Audience: Data scientists and analysts who need a robust, production-grade relational database.
  • URL: postgresql.org

pgAdmin

  • Definition: An open source administration and development platform for PostgreSQL databases.
  • Purpose: Administer PostgreSQL servers, design schemas, write and run SQL queries, and inspect database objects through a graphical interface.
  • Audience: Data scientists and database administrators who work with PostgreSQL.
  • URL: pgadmin.org

14.5 Containers

Podman Desktop

  • Definition: An open source, daemonless container engine and desktop GUI for building and running OCI containers.
  • Purpose: Build and deploy data science applications and toolchains inside Linux containers without requiring root privileges.
  • Audience: Data scientists who need reproducible, portable computing environments that work consistently across machines.
  • URL: podman.io

14.6 GIS

QGIS

  • Definition: An open source geographic information system (GIS) platform for viewing, editing, and analyzing spatial data.
  • Purpose: Perform GIS analysis, visualize spatial datasets, and build map products without proprietary software overhead.
  • Audience: GIS analysts who need a full-featured desktop GIS, and advanced analysts who use the QGIS Python API for scripting.
  • URL: qgis.org

14.7 Publishing & Sharing

Quarto

  • Definition: An open source technical publishing system that supports Python, R, Julia, and JavaScript.
  • Purpose: Author reproducible reports, books, websites, and presentations that weave together prose, code, and output in a single document.
  • Audience: All USACE geospatial data scientists who produce technical documents or share analytical results.
  • URL: quarto.org

Shiny Server

  • Definition: An open source server platform for hosting Shiny web applications built in R.
  • Purpose: Self-host and share interactive Shiny applications on an internal or public web server without a commercial SaaS platform.
  • Audience: Data scientists who need to deploy Shiny apps on infrastructure they control.
  • URL: posit.co/products/open-source/shinyserver

Zotero

  • Definition: A free, open source reference management tool for collecting, organizing, citing, and sharing research sources.
  • Purpose: Manage bibliographic references and generate citations for technical reports and Quarto documents.
  • Audience: All USACE geospatial data scientists who cite methods, data sources, or standards in their work.
  • URL: zotero.org

14.8 Knowledge Representation

Protege Desktop

  • Definition: A free, open source ontology editor developed at Stanford University with full support for OWL 2 and description logic reasoners.
  • Purpose: Develop and maintain ontologies for knowledge representation, semantic data integration, and knowledge-based applications.
  • Audience: Data scientists and knowledge engineers working on semantic data modeling or Army Vantage ontology workflows.
  • URL: protege.stanford.edu | Install via app-portal.usace.army.mil