๐ŸŽจ JoyCaption - Advanced Image Captioning

MultimodalTextbox
0 1
0 1
8 4096

๐Ÿงช This application uses the JoyCaption model to generate high-quality, detailed captions for images.

Key Features:

  • ๐Ÿ†“ Free & Open: No restrictions, open weights, training scripts included
  • ๐Ÿ”“ Uncensored: Equal coverage of SFW and NSFW concepts
  • ๐ŸŒˆ Diversity: Supports digital art, photoreal, anime, furry, and all styles
  • ๐ŸŽฏ High Performance: Near GPT4o-level captioning quality
  • ๐Ÿ”ง Minimal Filtering: Trained on diverse images for broad understanding

Supported image formats: PNG, JPG, JPEG, WEBP

Caption Styles:

  • Formal Detailed: Long descriptive captions in formal tone
  • Creative: Artistic and expressive descriptions
  • Simple: Concise, straightforward descriptions
  • Technical: Detailed technical analysis of composition and elements
  • Custom: User-defined prompts for specialized captioning

Model: fancyfeast/llama-joycaption-alpha-two-hf-llava

Architecture: LLaVA with Llama 3.1 base