AI Toolbox | AI Art Weekly

AniTalker

AniTalker is another talking head generator that can animate talking faces from a single portrait and input audio with naturally flowing movements and diverse outcomes.

30.07.24 · Project Page · Code · Talking Head Generation

Magic Clothing

Magic Clothing can generate customized characters wearing specific garments from diverse text prompts while preserving the details of the target garments and maintain faithfulness to the text prompts.

29.07.24 · Code · Text-to-Image · Image Editing

Audio-Synchronized Visual Animation

Audio-Synchronized Visual Animation can animate static images using audio clips to create synchronized visual animations. It uses the AVSync15 dataset and the AVSyncD diffusion model to produce high-quality animations across different audio types.

26.07.24 · Project Page · Code · Audio-to-Video

ClickDiff

ClickDiff can generate controllable grasps for 3D objects. It employs a Dual Generation Framework to produce realistic grasps based on user-specified or algorithmically predicted contact maps.

25.07.24 · Code · 3D Object Generation

ViPer

ViPer can personalize image generation by capturing individual user preferences through a one-time commenting process on a selection of images. It utilizes these preferences to guide a text-to-image model, resulting in generated images that align closely with users’ visual tastes.

25.07.24 · Project Page · Code · Personalized Image Generation

Magic Fixup

Adobe’s Magic Fixup lets you edit images with a cut-and-paste approach that fixes edits automatically. Can see this being super useful for generating animation frames for tools like AnimateDiff. But it’s not clear yet if or when this hits Photoshop.

25.07.24 · Project Page · Code · Image Editing · Image Restoration · Image Segmentation

SV4D

SV4D can generate dynamic 3D content from a single video. It ensures that the new views are consistent across multiple frames and achieves high-quality results in video synthesis.

24.07.24 · Project Page · Code · Model · Video-to-4D · 3D Object Generation

Artist

Artist stylizes images based on text prompts, preserving the original content while producing high aesthetic quality results. No finetuning, no ControlNets, it just works with your pretrained StableDiffusion model.

23.07.24 · Project Page · Code · Image Style Transfer · Controllable Image Generation

DreamCar

DreamCar can reconstruct 3D car models from just a few images or single-image inputs. It uses Score Distillation Sampling and pose optimization to enhance texture alignment and overall model quality, significantly outperforming existing methods.

23.07.24 · Project Page · Code · 3D Object Generation · Image-to-3D

Cinemo

Cinemo can generate consistent and controllable image animations from static images. It achieves enhanced temporal consistency and smoothness through strategies like learning motion residuals and employing noise refinement techniques, allowing for precise user control over motion intensity.

23.07.24 · Project Page · Code · Image-to-Video · Video Editing

MasterWeaver

MasterWeaver can generate photo-realistic images from a single reference image while keeping the person’s identity and allowing for easy edits. It uses an encoder to capture identity features and a unique editing direction loss to improve text control, enabling changes to clothing, accessories, and facial features.

23.07.24 · Project Page · Code · Text-to-Image · Personalized Image Generation

UniTalker

UniTalker can create 3D face animations from speech input! It works better than other tools, making fewer mistakes in lip movements and performing well even with new data it hasn’t seen before.

19.07.24 · Project Page · Code · Audio-to-3D · Audio-to-Motion

Shape of Motion

Shape of Motion can reconstruct 3D scenes from a single video. The method is able to capture the full 3D motion of a scene and can handle occlusions and disocclusions.

18.07.24 · Project Page · Code · Video-to-4D · Video Scene Detection

MusiConGen

MusiConGen can generate music tracks with precise control over rhythm and chords. It allows users to define musical features through symbolic chord sequences, BPM, and text prompts.

17.07.24 · Project Page · Code · Text-to-Music

IMAGDressing-v1

IMAGDressing-v1 can generate human try-on images from input garments. It is able to control different scenes through text and can be combined with IP-Adapter and ControlNet pose to enhance the diversity and controllability of generated images.

17.07.24 · Project Page · Code · Image Editing · Image Inpainting · Personalized Image Generation

SparseCtrl

SparseCtrl is a image-to-video method with some cool new capabilities. With its RGB, depth and sketch encoder and one or few input images, it can animate images, interpolate between keyframes, extend videos as well as guide video generation with only depth maps or a few sketches. Especially in love with how scene transitions look like.

17.07.24 · Project Page · Code · Text-to-Video

Generating 3D House Wireframes with Semantics

3DWire can generate 3D house wireframes from text! The wireframes can be easily segmented into distinct components, such as walls, roofs, and rooms, reflecting the semantic essence of the shape.

17.07.24 · Project Page · Code · 3D Object Generation · 3D Scene Generation

An Object is Worth 64x64 Pixels

An Object is Worth 64x64 Pixels can generate 3D models from 64x64 pixel images! It creates realistic objects with good shapes and colors, working as well as more complex methods.

16.07.24 · Project Page · Code · Image-to-3D · 3D Object Generation

AccDiffusion

AccDiffusion can generate high-resolution images with fewer object repetition! Something Stable Diffusion has been plagued by since its infancy.

15.07.24 · Project Page · Code · Controllable Image Generation

Noise Calibration

Noise Calibration can improve video quality while keeping the original content structure. It uses a noise optimization strategy with pre-trained diffusion models to enhance visuals and ensure consistency between original and enhanced videos.

14.07.24 · Project Page · Code · Video Restoration · Video Editing