This is a new series looking at practical applications of Machine Learning from a Linux perspective. We only feature free and open source software in this series.
Let’s clear up one potential source of confusion at the outset. What’s the difference between Machine Learning and Deep Learning? The two terms mean different things.
In essence, Machine Learning is the practice of using algorithms to parse data, learn insights from that data, and then make a determination or prediction. The machine is ‘trained’ using huge amounts of data.
Deep Learning is a subset of Machine Learning that uses multi-layers artificial neural networks to deliver state-of-the-art accuracy in tasks such as object detection, speech recognition, language translation and others. Think of Machine Learning as cutting-edge, and Deep Learning as the cutting-edge of the cutting-edge.
Both Machine Learning and Deep Learning are changing the world. Deep Learning is trending.
We’ve written short reviews for each application. And there are many more reviews currently under preparation.
Graphics |
---|
CodeFormer is command-line software which offers blind face restoration. This aims at recovering high-quality faces from the low-quality counterparts suffering from unknown degradation, such as low-resolution, noise, blur, compression artifacts. This is freeware. |
Easy Diffusion is a web interface to Stable Diffusion designed to be as easy-to-use as possible. |
FBCNN (flexible blind convolutional neural network) is software which seeks to remove artifacts from JPEGs while preserving the integrity of the images. |
GFPGAN lets you perform real-world face restoration. This software can radically improve the quality of photos. |
InvokeAI is a Stable Diffusion toolkit. Generate highly detailed images based on text descriptions, or from images/drawings. |
Old Photo Restoration uses deep learning to restore old photos via deep latent space translation. |
Real-ESRGAN creates practical algorithms for general image/video restoration. |
Rembg is software to remove the backgrounds from images. The tool relies on the U2Net model, a machine learning model that performs object cropping in a single shot. |
Stable Diffusion web UI is a web interface to Stable Diffusion, a deep learning text-to-image diffusion model capable of generating photo-realistic images given any text input. |
Upscayl is GUI software that uses sophisticated AI models to enhance your images by guessing what the details could be. |
Audio |
---|
Demucs is billed as “a state-of-the-art music source separation model, currently capable of separating drums, bass, and vocals from the rest of the accompaniment”. |
Coqui STT is a deep-learning toolkit for training and deploying speech-to-text models. |
StemRoller is GUI software which lets you separate vocal and instrumental stems from any song with a single click. |
Ultimate Vocal Remover is a GUI that lets you isolate stems from music. It offers convenient access to a wide range of different models. |
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. |
Science |
---|
astroML is a Python module which offers statistical data analysis in astronomy and astrophysics. |
scikit-learn is a machine learning library built on top of SciPy that supports supervised and unsupervised learning. It also provides various tools for model fitting, data preprocessing, model selection, model evaluation, and many other utilities |
Chat |
---|
ChatGPT (by lencx) is a desktop application wrapper for the ChatGPT website. The chatbot generates human-like text in a conversational style and can be used for a variety of natural language processing tasks. |
If you would have recommendations for other good free and open source machine learning software for Linux, please comment below.
Machine learning science apps please!