Tag: open-source
Filtered selection of tools tagged open-source.
Apache Cassandra
Apache Cassandra is an open-source distributed NoSQL database for highly available, horizontally scalable workloads across many nodes.
Hermes Agent
Hermes Agent is an open-source Nous Research agent designed as a long-running personal work agent with memory, skills, tools, and messaging integrations.
Inkscape
Inkscape is a design and creative tool for open-source vector graphics for logos, icons, diagrams, and scalable illustrations.
Marian NMT
Open-source framework for neural machine translation and technical NMT workflows.
Quasar Framework
Quasar Framework is a powerful open-source framework for building web and mobile applications. It enables developers to create high-quality, responsive, cross-platform apps with a single codebase. Quasar supports modern technologies such as Vue.js and offers an extensive collection of UI components that are specifically optimized for fast development and high performance.
Adapt Learning
Adapt Learning is an open-source authoring platform for creating responsive e-learning content. It supports modular course creation, adaptive learning, and standards such as SCORM and xAPI, making it suitable for teams that want flexible, customizable learning experiences.
Airbyte
Airbyte is an open-source data integration platform that enables developers and businesses to extract, transform, and load (ETL) data from various sources into target systems. Supporting a wide range of data sources and destinations, it stands out for its high customizability and ease of extensibility. Airbyte offers both a free open-source version and paid plans with extended features.
Apache Airflow
Apache Airflow is useful when workflow orchestration for data pipelines needs to be managed as code with clear DAGs, dependencies, retries, and operational control. It is especially relevant for data engineering teams with many scheduled jobs, but it can create too much overhead for small standalone scripts.
Apache Beam
Apache Beam is a powerful open-source framework for unified development of data processing pipelines. It enables developers to create both batch and streaming data processing within a single model that can run on various execution environments. Apache Beam supports multiple programming languages and integrates flexibly with different backend engines such as Apache Flink, Apache Spark, or Google Cloud Dataflow.
Apache Druid
Apache Druid is a powerful, open-source analytics database designed for real-time analysis of large data volumes. It combines fast ingestion, low latency for queries, and high scalability, enabling companies and developers to perform complex data analysis in real-time. Druid is commonly used in areas such as Business Intelligence, Monitoring, and Ad-Hoc Analysis.
Apache Hadoop (self-hosted)
Apache Hadoop is an open-source framework for distributed storage and processing of large data sets. It enables companies and developers to store and analyze large amounts of data in clusters of commodity servers with high scalability. The self-hosted variant offers full control over infrastructure and data, which is particularly attractive for companies with high data protection requirements or special adaptation needs.
Apache Kafka
Apache Kafka is a powerful open-source platform for distributed real-time data streaming. It enables organizations to reliably capture, process, and analyze large volumes of data streams. Kafka is commonly used for use cases such as event streaming, data integration, and building modern data-driven applications.
Apache NiFi
Apache NiFi is a data and automation tool for visual dataflow automation for ingestion, routing, transformation, and system integration.
Apache Pinot
Apache Pinot is a distributed open-source analytics database built for real-time analysis of large-scale data. It helps teams run low-latency complex queries on streaming and batch data, making it a strong fit for data-intensive applications that need fast insights and high scalability.
Apache Pulsar
Apache Pulsar is a scalable open-source platform for distributed messaging and streaming with multi-tenancy, geo-replication, and low-latency data processing.
Auto-sklearn
Auto-sklearn is an open-source automation solution for machine learning (AutoML) that enables developers and data scientists to create models efficiently without requiring deep knowledge of model optimization. By combining meta-learning and Bayesian optimization, Auto-sklearn automates the selection and tuning of algorithms, significantly reducing development time and improving model quality.
BibDesk
BibDesk is a macOS reference manager built around BibTeX workflows, suited to people who want to manage academic sources locally with precise control over metadata, citation keys, PDFs, and LaTeX-friendly bibliographies.
BigBlueButton
BigBlueButton is an open-source web conferencing tool for education and training, especially useful for schools, universities, and organizations that want self-hosting, classroom workflows, breakout rooms, recording, and moderation without relying on a proprietary all-in-one platform.
Blockly
Blockly is a browser-based open-source library that enables the creation of graphical programming environments. Users can generate functional code by visually assembling code blocks without in-depth programming knowledge. It supports multiple programming languages, including JavaScript, Python, and others, and is commonly used in education and prototype development.
Caffe
Caffe is a well-known open-source framework for machine learning, particularly suited for the development and training of deep neural networks. Originally developed at the University of Berkeley, Caffe offers an efficient and flexible platform that is used by researchers and developers to create and implement complex AI models. The framework is characterized by its speed and user-friendliness and supports various applications in image and video processing.
Curl
Curl is a versatile command-line tool primarily used for transferring data with URL syntax. It supports a wide range of protocols such as HTTP, HTTPS, FTP, and many more. As open-source software, Curl is popular worldwide among developers, system administrators, and IT professionals who seek simple and efficient methods to send and receive data over the internet.
DeepFaceLab
DeepFaceLab is an open-source software for creating deepfake videos. The application allows users to swap or manipulate faces in videos using artificial intelligence. It is particularly useful in the fields of research, media production, and creative projects. The software offers a range of tools for face reconstruction, training neural networks, and precise video editing.
DuckDB
DuckDB is a lightweight, embedded relational database designed specifically for analytical workloads. It enables fast SQL queries directly within local applications or scripts without the need to run a separate database server. As an open-source project, DuckDB provides developers with a flexible and high-performance solution for data analysis that integrates seamlessly with many programming languages and development environments.
Fastai
Fastai is a powerful open-source library for machine learning that is based on Python and makes it easier to get started with deep learning and other machine learning methods. Developed with the goal of making complex models more accessible and faster to train, Fastai provides an intuitive API that helps both beginners and experienced developers create efficient AI applications. The library builds on PyTorch and combines advanced techniques with practical tutorials and courses that promote learning and applying AI technologies.
freeCodeCamp
freeCodeCamp is a free, open-source learning platform for building programming and web development skills through interactive lessons, projects, and certifications.
H5P
H5P is an open-source framework for creating interactive learning content such as quizzes, presentations, videos, and exercises.
Ionic Framework
Ionic Framework is an open-source toolkit for building cross-platform mobile and web applications. It enables developers to create native-like apps for iOS, Android, and the web using familiar web technologies such as HTML, CSS, and JavaScript. With an extensive collection of UI components and powerful development tools, Ionic Framework supports fast and efficient development of modern applications.
JSBin
JSBin is an open-source, browser-based tool for writing, testing, and sharing HTML, CSS, and JavaScript in real time.
LibreOffice Calc
LibreOffice Calc is a powerful, free spreadsheet software that is part of the open-source LibreOffice suite. It offers extensive features for data analysis, spreadsheet management, and visualization, suitable for both private users and professionals. As an alternative to commercial office programs, Calc is especially appealing due to its openness and adaptability.
Metabase
Metabase is an open-source business intelligence platform that enables companies to analyze data easily and present it in interactive dashboards. The software is designed for users without deep programming knowledge and offers an intuitive interface that lets data queries be created and visualized quickly. As a versatile tool, Metabase supports a range of data sources and is especially well suited for teams that want to make data-driven decisions.
MXNet
MXNet is a flexible and efficient open-source machine learning framework that is especially well suited for developing and training deep neural networks. It supports multiple programming languages and offers a scalable architecture that can be used on both individual devices and distributed environments. MXNet is known for its performance and flexibility, making it a popular choice for developers in the field of artificial intelligence.
NATS
NATS is an open-source messaging system for cloud-native applications, microservices, event streams, and distributed systems.
OCRmyPDF
OCRmyPDF adds a searchable text layer to scanned PDFs and is especially useful as a clean preprocessing step in local document pipelines.
Onsen UI
An open-source framework for building cross-platform mobile apps with a native look and feel, using HTML5, CSS, and JavaScript.
OpenFaaS
OpenFaaS is an open-source platform that allows developers to easily create, deploy, and manage serverless functions. Focusing on containerization and cloud integration, OpenFaaS provides a flexible environment for running microservices and functions independently of the underlying infrastructure. The platform supports multiple programming languages and can be used both locally and in the cloud.
OpenNMT
OpenNMT is a powerful open-source platform for neural machine translation (NMT). Designed to provide flexible and efficient translation solutions, OpenNMT enables businesses, researchers, and developers to train and deploy custom translation models. The platform supports various programming languages and frameworks and is used worldwide in a wide range of applications.
PaddleOCR
PaddleOCR is an open-source OCR toolkit for developers who want more control over recognition, layout analysis, and custom document pipelines.
PostgreSQL
PostgreSQL is a powerful, object-relational database management system (ORDBMS) renowned for its stability, flexibility, and extensibility. As open-source software, it provides developers and businesses with a robust platform for managing relational data with SQL support and a wide range of advanced features. PostgreSQL is suitable for projects of all sizes, from small applications to complex systems handling large volumes of data.
Python
Python is a versatile, interpreted programming language known for its simple syntax and high readability. As an open-source project, it is used worldwide by developers across a wide range of applications — from web development and data analysis to artificial intelligence and scientific computing. Its extensive standard library and large community make Python one of the most popular tools for programmers of all skill levels.
RabbitMQ
A reliable open-source message broker for asynchronous communication, distributed systems, and microservice architectures.
RawTherapee
RawTherapee is a powerful open-source software for editing RAW images. It is designed for photographers and image editors seeking extensive tools to optimize and develop raw data from digital cameras. With a wide range of features, RawTherapee enables detailed and precise image editing—ranging from exposure correction to color enhancement. The software is cross-platform and supports numerous camera models.
Samza
Apache Samza is an open-source framework for real-time stream processing. It is designed for developers, data engineers, and organizations that need scalable, fault-tolerant applications for continuously arriving data, with strong support for Kafka and distributed deployment environments.
Simplenote
Simplenote is a lightweight, streamlined note-taking app focused on the essentials: quickly capturing and managing notes. As an open-source tool, it offers a simple user interface without unnecessary features. The app syncs all notes across multiple devices and supports Markdown formatting, making it suitable for both personal and professional use.
Snorkel
Snorkel is an open-source platform for automated data labeling and data preparation for machine learning. It enables companies and researchers to efficiently annotate large amounts of unstructured data with less manual effort. By combining programmatic labeling methods and machine learning techniques, Snorkel supports the rapid development of training datasets for AI models.
TensorFlow / Keras
TensorFlow and Keras are open-source tools for building and training machine learning and deep learning models, with broad support for research, education, and production use.
Tesseract OCR
Tesseract OCR is an open-source OCR engine for local text recognition and remains an important building block when privacy, control, or cost argue against cloud OCR.
Typesense
Typesense is a modern open-source search engine for developers who want fast, relevant, and easy full-text search in their applications. It combines low latency, typo-tolerant search, faceting, multilingual support, and a simple API, making it a practical alternative to more complex search solutions.
VS Code Dev Containers
VS Code Dev Containers is an open-source extension for Visual Studio Code that allows developers to define and use development environments within Docker containers. These containers provide a consistent and isolated environment, simplifying project setup and management while enabling platform-independent reproducible development conditions. Especially useful in teams and complex projects, VS Code Dev Containers facilitates faster onboarding and reduces misconfigurations.
Waifu2x
An open-source AI image upscaler and denoiser originally built for anime, now also used for photos and other graphics.
XGBoost
XGBoost is a developer and infrastructure tool for machine-learning library for gradient boosting, tabular data, and robust predictive models.
Zeppelin
Zeppelin fits workflows where notebook-based data analysis with multiple interpreter backends is a regular part of the job. It is especially useful for teams that want to work collaboratively on exploratory Spark- and SQL-adjacent analysis in a structured notebook environment.