Generative Machine Learning
Illustrated Stable Diffusion & How diffusion models work are great reads.
Progress in models like Midjourney is getting insanely good. Can even generate nice header images or logos.
Using Gauss (Mochi Diffusion is nice too) (built on Core ML Stable Diffusion) to generate prompts. ImaginAIry is useful too.
Trying to use more of these technologies in my day to life. Specifically ChatGPT seems incredible. Generates nice code too. Need to be wary though as they don't do much novelty yet.
Fast Stable Diffusion Colab Notebooks is great.
ChatGPT Resources & Why does ChatGPT work so well are nice reads to understand how ChatGPT works.
Want to understand DeepFloyd IF model.
Notes
- The net effect of the last 18 mo has been to slightly lubricate human imagination, making it easier for individuals to visualize possible worlds/scenes/moods. It’s only a medium-big deal—unless this is a foretaste of similar acceleration in music, code, and text.
- Greatest impact of "generative ML" will not be on art/creation, but by bringing technology leverage to billions of hours of boring data entry/manipulation jobs.
- Diffusion is just an easy-to-optimize way to give neural networks adaptive computation time. Makes sense then that diffusion models beat GANs, which only get one forward pass to generate an image. Have to wonder what other ways there are to integrate for loops into NNs.
- The fact that LLMs seem to give you more accurate results if you prompt them with "think step by step" is absolutely fascinating. An analogy might be intuitive vs logical thinking.
Links
- DALL·E: Introducing Outpainting (2022) (HN)
- Ask HN: Am I the only one tired of seeing DALL·E /Stable Diffusion posts? (2022)
- Stable Diffusion is a big deal (2022) (HN)
- Stable Diffusion Textual Inversion (HN)
- Stable Diffusion Public Release (2022) (HN)
- Peacasso - Web UI for Stable Diffusion Models. (Reddit)
- Optimized Stable Diffusion - Modified version of the Stable Diffusion repo, optimized to use less VRAM than the original by sacrificing inference speed.
- EvoGen - Evolutionary algorithm that optimizes prompts for text-to-image models for aesthetics.
- All about the fundamentals and working of Diffusion Models
- Experiments with Stable Diffusion (Tweet)
- DreamStudio - Front end and API to use the recently released stable diffusion image generation model.
- Stable Diffusion web UI (HN)
- koi - Open source plug-in for Krita that allows you to use AI to accelerate your art workflow.
- Awesome Stable-Diffusion
- Simple Stable Diffusion - Get stable diffusion running in <10 minutes in colab.
- Stable Diffusion Playground
- Create videos with Stable Diffusion - By exploring the latent space and morphing between text prompts.
- Long Stable Diffusion: Long-form text to images
- Infinite Stable Diffusion Videos (HN)
- Why 'weird patterns' arise in the latent space of an image generation models (Tweet)
- Textual Inversion fine-tuning example (Tweet)
- Progressive Distillation for Fast Sampling of Diffusion Models (2022)
- Stable Diffusion prompting cheat sheet
- The Man behind Stable Diffusion (2022)
- Inpainting - Web GUI for inpainting with Stable Diffusion using the Replicate API. (Code)
- Dreambooth on Stable Diffusion (Optimized Fork)
- Japanese Stable Diffusion
- Stable Diffusion web UI
- Stable DreamBooth - Implementation of DreamBooth based on Stable Diffusion.
- George Hotz | stable diffusion, in tinygrad (2022)
- CLIP-Mesh: Generating textured meshes from text using pretrained image-text models (2022) (Code)
- Prompt-to-Prompt Image Editing with Cross Attention Control (2022) (Code)
- Stable Diffusion REST API
- Visual Taste Approximator - Simple tool that helps anyone create an automatic replica of themselves that can approximate their own personal visual taste.
- Stable Diffusion concepts library (Tweet)
- Stable Diffusion for Apple Silicon
- Learn time series with a story illustrated by Stable Diffusion (2022) (HN)
- Daemon which watches a queue and runs stable diffusion
- Text-to-image for my inbox (2022)
- sdutils - Stable Diffusion Utility Wrapper.
- Diffusion Bee - Stable Diffusion GUI App for M1 Mac. (HN)
- Art Hub AI - Discover, upload and share AI generated art pieces..
- AI Content Generation, Part 1: Machine Learning Basics (Tweet)
- Inpainting with Stable Diffusion & Replicate
- Diffusion Models: A Comprehensive Survey of Methods and Applications
- stability-clients - Client implementations that interact with the Stability Generator API.
- Storyweaving with AI (Code)
- dreamlike.art - AI Art Generator.
- Stable Diffusion Photoshop Plugin
- Outpainting with Stable Diffusion on an infinite canvas
- Stable Diffusion: With Composition
- Swift Diffusion - Single-file re-implementation of Stable Diffusion model.
- From Deep Learning Foundations to Stable Diffusion (2022) (HN)
- Production software using OpenAI GPT-3 APIs
- CHARL-E - Run Stable Diffusion on your M1 Mac. (HN)
- Stable Diffusion in Tensorflow / Keras (Colab)
- Osmosis.Studio - Product Ad Creative and Optimization with Generative AI.
- Upscale to huge sizes and add detail with SD Upscale, it's easy!
- Open Prompts - Dataset of 10M Stable Diffusion generations. (HN)
- KREA - Create better prompts.
- GLID-3-XL-stable - Stable diffusion back-ported to the OpenAI guided diffusion codebase, for easier development and training.
- ImaginAIry - AI imagined images. Pythonic generation of stable diffusion images. (HN)
- UnstableFusion - Stable Diffusion desktop frontend with inpainting, img2img and more.
- Dreamfields-3D - Colab friendly toolkit to generate 3D mesh model / video / nerf instance / multiview images of colourful 3D objects by text and image prompts input, based on dreamfields.
- Fast Stable Diffusion Colab Notebooks - 25% speed increase + memory efficient. (Colab)
- High-performance image generation using Stable Diffusion in KerasCV (HN)
- Video Killed The Radio Star - Notebook and tools for end-to-end automation of music video production with generative AI.
- Custom scripts for the stable diffusion web UI
- Phenaki - Model for generating minutes-long, changing-prompt videos from text. (HN)
- GhostlyStock - Stock Photos Using Stable Diffusion. (HN)
- Prompt engineering is hard (2022)
- Notes and plans for Fast Diffusion course
- Stable Diffusion CPU only
- TabDDPM: Modelling Tabular Data with Diffusion Models (2022) (Code)
- Animation Script - Animator script for SD Web UI.
- Latent space walking: minimal Keras Colab (Tweet)
- The Illustrated Stable Diffusion (2022) (HN)
- DALL·E Node - Use DALL·E 2 with NodeJS.
- Novel View Synthesis with Diffusion Models (2022) (HN)
- Imagen Video - High definition video generation with diffusion models. (HN)
- Is the AI spell-casting metaphor harmful or helpful? (2022)
- Ask HN: What am I supposed to do after I’m “disrupted”? Work in video and CG (2022)
- Stable-Dreamfusion - PyTorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. (HN)
- Text2All - Comprehensive list of resources about text-guided generative models.
- HuggingFace Space and model of VToonify (2022) (Tweet)
- How AI Image Generators Work (Stable Diffusion / Dall-E) (2022)
- Manifest - How to make prompt programming with Foundation Models a little easier.
- Implementation of Dreambooth by way of Textual Inversion
- InvokeAI - Open source Stable Diffusion toolkit and WebUI. (HN)
- Astraea - Tailor-made AI image generation.
- Getting started with diffusion
- 14 awesome Stable Diffusion notebooks
- Understanding Diffusion Models: A Unified Perspective (2022) (Annotated)
- Maple Diffusion - Runs Stable Diffusion models locally on macOS / iOS devices, in Swift, using the MPSGraph framework.
- Asymmetric Tiling for stable-diffusion-webui
- Prompt-to-Prompt: Latent Diffusion and Stable Diffusion implementation (2022)
- Real-time inference for Stable Diffusion
- latentspace.dev - Exploring stable diffusion latent space. (Tweet)
- Photoshop for Text (2022) (HN)
- Artists: AI Image Generators Can Make Copycat Images in Seconds (2022)
- Stability.AI Easy Diffusion - Google Colab Notebook designed to be a relatively easy to use all-in-one suite for stable diffusion.
- Why we chose not to release Stable Diffusion 1.5 as quickly (2022) (HN)
- A Survey on Generative Diffusion Model (2022) (Code)
- Making a Video from Prompts with Stable Diffusion
- Maple Diffusion - Stable Diffusion inference on iOS / macOS using MPSGraph.
- VectorArt.ai - Vector Graphics with Stable Diffusion. (HN)
- Generative Image workflow in Runway
- How does stable diffusion work
- DiffusionDB - Large-scale text-to-image prompt gallery dataset based on Stable Diffusion.
- Implementation of a server for the Stability AI Stable Diffusion API
- Compositional Visual Generation with Composable Diffusion Models (2022) (Code)
- Avatar AI - Create your own AI-generated avatars.
- CLIP Interrogator
- Animation focused workflow frontend for Stable Diffusion
- Backend for my Stable diffusion projects
- Carefree Creator - AI-powered creator for everyone.
- Categorical SDEs with Simplex Diffusion (2022) (Tweet)
- Reaction-diffusion - Mathematical model describing how two chemicals might react to each other as they diffuse through a medium together.
- Banana Serverless - Basic framework for serving Stable Diffusion in production using simple HTTP servers.
- Sketch Diffusion – Live Painting with Stable Diffusion on Meta Quest Pro
- List of Stable Diffusion resources (HN)
- Invasive Diffusion: one unwilling illustrator found her turned into an AI model (2022) (HN)
- AI Horde - Crowdsourced distributed cluster for AI art and text generation.
- Distributed Diffusion - Train a Stable Diffusion model over the internet with Hivemind.
- Unprompted for Stable Diffusion - Text generator written for Stable Diffusion workflows.
- Rise of generative AI will be comparable to the rise of CGI in the early 90s (2022) (HN)
- DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps (2022) (Code)
- Ask CLI - Deno CLI for pinging GPT-3 and iterating with chain of thought prompting.
- Hugging Face Diffusion Models Course
- Ask HN: How to get into AI generation (images,text) (2022)
- Stable Diffusion and AI generated art is absolutely wild in every way (2022)
- diffusers-rs - Diffusers API in Rust/Torch.
- AI Art Tools and Resources in One Place (HN)
- Stretch iPhone to its limit: 2GiB Stable Diffusion model runs locally on device (2022) (HN)
- Generative AI: A Creative New World (2022)
- Stable Diffusion with Colossal-AI
- DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models (2022) (Code)
- Stable-Diffusion + Fused CUDA kernels
- Dall-E 2 AI Image Generator - Using Upstash for message queue + Redis. (Code)
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model (2022) (Code)
- How diffusion models work
- Implementation of Paint-with-words with Stable Diffusion
- Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding (2022) (Code) (HN)
- PALBERT: Teaching ALBERT to Ponder (2022) (Code)
- DiffusionDet: Diffusion Model for Object Detection (2022) (Code)
- Some notes on the Stable Diffusion safety filter (2022) (HN)
- Lightning Diffusion - Provides components to finetune and serve diffusion model on lightning.ai.
- Deforum Stable Diffusion
- Deforum - Community of AI image synthesis developers, enthusiasts, and artists. (GitHub)
- Shift-Attention - In stable diffusion, generate a sequence of images shifting attention in the prompt.
- London, 1910 by Midjourney
- GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation (2022) (Code)
- Magic3D: High-Resolution Text-to-3D Content Creation (2022) (HN)
- Stable Diffusion with Nix - Quickly get up and running using Stable Diffusion with Nix flakes.
- Minimal text diffusion - Minimal implementation of diffusion models for text generation.
- VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models (2022)
- RAD-NeRF: Real-time Neural Talking Portrait Synthesis
- OpenAI Cookbook - Examples and guides for using the OpenAI API.
- Dispict - Design a growing artistic exhibit of your own making, with semantic search powered by OpenAI CLIP.
- Stable Diffusion 2.0 (2022) (HN) (Reddit) (Code)
- Generate photo-realistic images from text using Stable Diffusion
- Upscayl - AI Image Upscaler.
- Kandinsky 2.0 - Multilingual text2image latent diffusion model.
- Dreambooth Extension for Stable-Diffusion-WebUI
- Dream Bench - Tool for benchmarking image generation models.
- Math of diffusion (2022)
- AI Art Panic (2022)
- Some notes on the Stable Diffusion safety filter (2022)
- Stable Diffusion 2.0 and the Importance of Negative Prompts for Good Results (2022) (HN)
- Ten Years of Image Synthesis (2022) (HN)
- Diffusion Models Live Event (2022)
- Core ML Stable Diffusion - Run Stable Diffusion on Apple Silicon with Core ML. (Stable Diffusion with Core ML on Apple Silicon) (HN)
- Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation (2022) (Code)
- Generative AI: autocomplete for everything (2022) (Tweet)
- Basic Dreambooth Guide - Short guide on the process of collecting a dataset and basic dreambooth settings.
- People tricking ChatGPT “like watching an Asimov novel come to life” (HN)
- ChatGPT Assistant - Extension that enhances your browsing experience on the ChatGPT website, with features like page fetching and more.
- ChatGPT for VSCode - VSCode extension that allows you to use ChatGPT.
- Placing #1 in Advent of Code with GPT-3 (HN)
- ChatGPT API - Node.js wrapper around ChatGPT. Uses headless Chrome until the official API is released.
- ChatGPT for Google - Chrome extension to show ChatGPT response in Google search results. (HN)
- A new AI game: Give me ideas for crimes to do (2022)
- Ask HN: How would you build a ChatGPT detector? (2022)
- GPT Index - Index created by GPT to organize external information and answer queries. (Use Cases) (Docs)
- ChatGPT export to PNG / PDF / HTML - Chrome extension for downloading your ChatGPT history to PNG, PDF or creating a sharable link.
- Stable Diffusion v 2.0 web UI
- GPT-3 Prompter - Use OpenAI's GPT-3 API prompter on any website.
- iOS app that generates images using Stable Diffusion v2
- ChatGPT passes the 2022 AP Computer Science A free response section (HN)
- ChatGPT can reply like a specific Reddit or HN user, including you
- Using ChatGPT as a Co-Founder (2022) (HN)
- Discuss HN: Software Careers Post ChatGPT+ (2022)
- ChatGPT vs. a Cryptic Crossword (2022) (HN)
- ShareGPT - Share your wildest ChatGPT conversations with one click. (Tweet)
- Awesome ChatGPT
- I Taught ChatGPT to Invent a Language (2022) (HN)
- Ask HN: What's in for ChatGPT, Stable Diffusion, etc. after dust settles? (2022)
- Awesome ChatGPT Prompts
- The Human's Guide to Competing with GPT (2022)
- Talk = GPT-2 and Whisper and WASM (HN)
- Ask HN: How do you cope with existential threat regarding career? (2022)
- ChatGPT Raycast extension
- Prompts for games and world building in ChatGPT (2022)
- showGPT - Guide to unlocking the power of AI and chatGPT.
- How to use ChatGPT as founder
- PyChatGPT - Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.
- Pair Programming with AI: Writing a Distributed, Fault-Tolerant Redis Client using ChatGPT (2022)
- Bumblebee - GPT2, Stable Diffusion, and More in Elixir. (HN) (Article)
- ChatGPT, Rot13, and Daniel Kahneman (2022) (HN)
- Why does ChatGPT work so well?
- Ask HN: Can ChatGPT generate fully functional code? (2022)
- Stable Diffusion 2.0 Negative Prompting
- GPT-2 Output Detector
- ChatGPT Advanced - Browser extension that augments your ChatGPT prompts with web results.
- Diffusion Chat - Chat-like interface for Stable Diffusion.
- StackExplain - Explain your error message in plain English using ChatGPT.
- I am frustrated with Stable Diffusion (2022) (HN)
- Make ChatGPT yourself
- ChatGPT Mac Menu Bar - Chat with OpenAI's ChatGPT in mac menu bar.
- Stableboost - Create personalized images with AI.
- ChatGPT can create cool images using TikZ (2022)
- Stable Diffusion in Docker
- OpenAI (ChatGPT) API Client for Go
- Summarize - Summarize web pages using OpenAI ChatGPT.
- Ask HN: How does ChatGPT work? (2022)
- Disputing a Parking Fine with ChatGPT (2022) (HN)
- GPT3 Writer in code
- ChatGPT Extension - Access OpenAI's ChatGPT from anywhere on the web.
- Ask HN: Is the weaponisation of ChatGPT now inevitable? (2022)
- ChatGPT Resources
- Photoshot - AI Avatar generator. (Code)
- Lexica Aperture - Generate realistic looking photographs. (Tweet)
- Ask HN: Self-hosted/open-source ChatGPT alternative? Like Stable Diffusion (2022)
- GPT-3 Visual Studio Code Extension - Use GPT-3 to generate documentation and get help debugging your code.
- Swift app demonstrating Core ML Stable Diffusion
- StructuredDiffusion: Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis (2022) (Code)
- All the ways to get around ChatGPT's safeguards (HN)
- How does GPT obtain its ability? Tracing emergent abilities of language models (2022) (HN)
- How Might Generative AI Change Programming? (2022)
- Stable Tuner - Finetuning SD in style.
- Prompt Engineering Guide (HN)
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (Web)
- Learn Prompting (Code)
- GPT‑3/LLM prompts are assembly code, not a human interface
- Mochi Diffusion - Run Stable Diffusion on Apple Silicon Macs natively.
- Scalable Diffusion Models with Transformers (2022) (Code)
- Stable Diffusion 2 Depth Guided model: architecture photos from dollhouse (2022) (HN)
- Stable Diffusion on AMD RDNA3 (2022) (HN)
- ChatGPT CLI
- Point-E: A System for Generating 3D Point Clouds from Complex Prompts (2022) (Code)
- Go GPT3 - OpenAI GPT-3 API client enabling Go programs to interact with the GPT3 APIs.
- DreamBooth - Cog model that takes training images as input and generates custom Stable Diffusion model weights as output.
- Gauss - Stable Diffusion macOS native app. (HN)
- AI Art Generator - For automating the creation of large batches of AI-generated artwork locally.
- Run Stable Diffusion natively on your Mac (HN)
- Various ways of serving Stable Diffusion using Keras
- Fine-tuning Stable Diffusion using Keras
- Stable Diffusion v2 Cog model
- Karlo - Text-conditional image generation model based on OpenAI's unCLIP architecture.
- High Resolution Depth Maps for Stable Diffusion WebUI
- Prompt Extend - Text generation model for generating suitable style cues given the main idea for a prompt.
- How diffusion models work
- My Midjourney AI Art (Code)
- Flake for running SD on NixOS
- Diffusion Models in Vision: A Survey (2022) (Code)
- Denoising Diffusion Implicit Models (DDIM)
- Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models (2022) (Code)
- PromptToImage - Stable Diffusion app for macOS based on CoreML models.
- Stable Diffusion Deploy - Learn to serve Stable Diffusion models on cloud infrastructure at scale.
- Muse: Text-To-Image Generation via Masked Generative Transformers (2023) (Web) (Tweet) (Code)
- How the physics of diffusion inspired modern AI art (2023)
- Denoising Diffusion models from first principle in Julia (2022) (HN)
- Guided denoising diffusion (2023)
- Personalizing Text-to-Image Generation via Aesthetic Gradients (2022) (Code)
- Simple tools for using open source text-to-image models
- Prompt Tool - Open-source tool that makes it easy for people to explore styles, and complex MidJourney prompts, visually. (Web)
- Awesome Generative AI (HN)
- Latent blending - Video transitions with incredible smoothness between prompts, computed within seconds.
- SD LEAP Booster - Fast fine tuning using a booster model that puts the initial state to a local minimum.
- Stable Karlo - Upscaling Karlo text-to-image generation using Stable Diffusion v2.
- StoryTeller - Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech.
- Diffusion for beginners
- Neural Frames - Create your own video clips with Stable Diffusion.
- Collection of generative AI applications
- Sketch - AI code-writing assistant that understands data content.
- Prompt Templates for Stable Diffusion
- Stable Diffusion in Code (AI Image Generation) - Computerphile (2022)
- Imitating Human Behaviour with Diffusion Models (2023)
- Mann-E - OpenJourney: Midjourney, but Open Source. (HN)
- Docker Diffusers API - Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.
- Stable Diffusion Accelerated
- Paint by Text - Edit your photos using written instructions, with the help of an AI. (Code)
- Stable Target Field for Reduced Variance Score Estimation (2023) (Code)
- Lsmith - StableDiffusionWebUI accelerated using TensorRT.
- Diffusion Models already have a Semantic Latent Space (2023) (Code)
- Stable Attribution (HN)
- Gen-1 by Runway (HN)
- Dino Diffusion: Bare-bones diffusion model code
- Mixture of diffusers for location-aware image generation
- Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation (2022) (Code)
- ControlNet - Let us control diffusion models.
- CLIP Guided Diffusion - CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
- TileMaker - Create seamless tiled images with material diffusion. (HN) (Code)
- Illusion Diffusion - Optical illusions using stable diffusion. (HN)
- Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models (Code)
- MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation (2023) (Code)
- Diffusion WebUI Colab - Choose your diffusion models and spin up a WebUI on Colab in one click.
- Universal Guidance for Diffusion Models (2023) (Code)
- Scribble Diffusion - Turn your sketch into a refined image using AI. (Code) (HN)
- tldream - Tiny little diffusion drawing app.
- Civitai - Stable Diffusion models, embeddings, hypernetworks and more. (Code)
- Easy Lora Handbook - Most easy-to-understand tutorial for using Lora within diffusers framework for AI Generation Researchers.
- On the Mathematics of Diffusion Models (2023)
- ComfyUI - Powerful and modular stable diffusion GUI.
- WebUI extension for ControlNet
- Stable Diffusion WebUI Colabs With ControlNet
- ControlLoRA: A Light Neural Network To Control Stable Diffusion Spatial Information (2023)
- generate potter - Boilerplate repo to deploy a custom stable diffusion model.
- ControlNet in Diffusers (2023)
- Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent (2023) (Code)
- One Transformer Fits All Distributions in Multi-Modal Diffusion (2022) (Code)
- Breadboard - Stable Diffusion Browser.
- Stable Diffusion WebUI 3D Model Loader
- Erasing Concepts from Diffusion Models (2023) (Code)
- Editing Implicit Assumptions in Text-to-Image Diffusion Models (2023) (Code)
- Web Stable Diffusion - Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
- Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models (2023) (Code)
- P+: Extended Textual Conditioning in Text-to-Image Generation (2023) (Code)
- ArtBot for Stable Diffusion - Front end GUI for interacting with the Stable Horde / Stable Diffusion distributed cluster.
- Adobe Firefly - AI Art Generator. (Explained) (HN)
- Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
- ReVersion: Diffusion-Based Relation Inversion from Images (2023) (Code)
- Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior (2023) (Code)
- Stable diffusion webui colab
- Token Merging for Stable Diffusion - Using nothing but pure python and pytorch, ToMe for SD speeds up diffusion by merging redundant tokens.
- DiffusionFastForward - Course on diffusion generative models in a fast forward mode.
- Deep Learning Foundations to Stable Diffusion (2023) (HN)
- Grounding DINO - Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
- Harry Potter By Balenciaga | Step by Step Tutorial
- HCP-Diffusion - Universal Stable-Diffusion toolbox.
- Open-MUSE - Open reproduction of MUSE for fast text2image generation.
- TemporalKit - Extension for Automatic1111 to add temporal consistency to your renders.
- How I Used Stable Diffusion and Dreambooth to Create A Painted Portrait of My Dog (2023)
- Stable Diffusion - Automatic - Opinionated fork/implementation of Stable Diffusion.
- Stable Diffusion RPC - gRPC server for a Stable Diffusion worker on Apple Platforms.
- ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation (2023) (Code)
- Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model
- What I learned about fine-tuning stable diffusion
- bulkai - Tool to generate AI images in bulk.
- Jupyter AI - Generative AI extension for JupyterLab.
- DeepFloyd IF by DeepFloyd, StabilityAI (HN)
- Scaling up GANs for Text-to-Image Synthesis (2023) (Code)
- FaceLit: Neural 3D Relightable Faces (2023) (Code)
- Training Stable Diffusion from Scratch for <$50k with MosaicML (2023)
- Reflected Diffusion Models (2023) (HN)
- StableSR - Exploiting Diffusion Prior for Real-World Image Super-Resolution.
- StableStudio by Stability AI (HN)
- Quivr - Dump all your files and thoughts into your GenerativeAI Second Brain and chat with it.
- Stable Diffusion Training with MosaicML
- Google Generative AI Python Client
- Diff-Pruning: Structural Pruning for Diffusion Models
- StableSR for Stable Diffusion WebUI
- Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation (2023) (Code)
- After Detailer - Auto detecting, masking and inpainting with detection model.
- One-click deepfake (face swap)
- Examples using Photoshop’s new “Generative Fill” feature (HN)
- Protein Design with Guided Discrete Diffusion
- StyleDrop: Text-to-Image Generation in Any Style (2023) (HN) (Paper) (Code)
- Segment Anything in High Quality (2023) (Code)
- Fictiverse - Real time Diffusion, Using Automatic1111 Stable Diffusion API. (HN)
- Generative AI learning path (HN)
- C++ Implementation of StableDiffusion (HN)
- Working anime QR codes using Stable Diffusion
- Stable Diffusion Cheat-Sheet - List of StableDiffusion styles and some notes for offline use.
- How to make a QR code with Stable Diffusion
- Video to video with Stable Diffusion (2023) (HN)
- Emergent Correspondence from Image Diffusion (2023) (Code)
- Stable Diffusion powered level editor for a 2D game (HN)
- SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds (HN)
- Ealain for macOS - Screensaver that generates abstract art using Stable Diffusion.
- Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond (Code)
- Comparing Adobe Firefly, Dalle-2, and OpenJourney (2023) (HN)
- Kitchen Theme for Stable Diffusion WebUI
- Sample code and notebooks for Generative AI on Google Cloud
- Generative Models by Stability AI
- StyleDrop: Text-to-Image Generation in Any Style in PyTorch
- AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning (2023) (Code)
- DiffUTE: Universal Text Editing Diffusion Model
- Tiny Stable Diffusion - Optimized Stable-diffusion that can run on GPUs with just 1GB of VRAM.
- Stable Diffusion WebGPU demo (HN)
- DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models (2023) (Code)
- OnnxStream - Running Stable Diffusion in 260MB of RAM. (HN)
- Diffusers - Modular Rust library for super fast Stable Diffusion inference - 45% faster than PyTorch.
- Unifying Flow, Stereo and Depth Estimation (2023) (Code)
- FABRIC: Personalizing Diffusion Models with Iterative Feedback (2023) (Code)
- Awesome Diffusion Personalization
- Stable Diffusion XL Video - Train a Video Diffusion Model Using Stable Diffusion XL Image Priors.
- distill-sd - 50% Smaller, Faster Stable Diffusion.
- Color-Diffusion - Using diffusion models to colorize black and white images. (HN)
- Stable-Diffusion-Burn - Stable Diffusion v1.4 ported to Rust's burn framework.
- Expert-Level Tutorials on Stable Diffusion & SDXL: Master Advanced Techniques and Strategies
- Tiny AutoEncoder for Stable Diffusion
- Stable Diffusion XL training and inference as a cog model
- StableCode (2023) (HN)
- Fooocus - Image generating software.
- Sandbox - Web app for exploring generative AI models.
- Text-To-Video-Finetuning - Finetune ModelScope's Text To Video model using Diffusers.
- Mirage3D - Open-Source Implementations of 3D Diffusion Models Optimized for GLB Output.
- EveryDream Trainer 2.0
- Stable Diffusion WebUI Docker
- Fine-tune SDXL with Replicate (2023)
- Opendream - Extensible, easy-to-use, and portable diffusion web UI. (HN)
- Huggingface Diffusers Compatible SDXL Unet Rewrite
- Stable Diffusion in pure C/C++ (HN)
- I Made Stable Diffusion XL Smarter by Finetuning It on Bad AI-Generated Images (2023) (HN)
- Image Inpainting for SDXL 1.0 Base Model + Refiner (2023)
- OneDiffusion - Run any Stable Diffusion models and fine-tuned weights with ease. (HN)
- Image Matching WebUI
- BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion (2023)
- Stable-Diffusion-XL-Burn - Stable Diffusion XL ported to Rust's burn framework. (Reddit)
- ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
- LDMVFI: Video Frame Interpolation with Latent Diffusion Models (2023)
- LECO - Low-rank adaptation for Erasing COncepts from diffusion models.
- Modular Diffusion - Python library for designing and training your own Diffusion Models with PyTorch.
- Tiny Dream - Embedded, Header Only, Stable Diffusion C++ implementation.
- Using SDXL's Revision workflow with and without prompts
- Unstable Journey - Desktop Paint application powered by Stable Diffusion, automatic1111 webui and PieCasso.
- StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator in PyTorch
- InstructDiffusion: A Generalist Modeling Interface for Vision Tasks