Adobe Firefly 2

paid
multimodal
Adobe Firefly 2 (2023) is a suite of generative AI models for creatives. Its Image 2 model creates high-quality images w...
Released: 2y 22d ago on 10/10/2023
Version: 2.0

AlexaTM 20B

paid
text
AlexaTM 20B (2022) is Amazon’s 20B-parameter multilingual seq2seq model. It achieves state-of-the-art performance on tas...
Released: 3y 10m ago on 01/01/2022
Version: 20B

AlphaCode

research
code
AlphaCode is a 41.4B-parameter transformer model for code generation (Feb 2022). Trained on public GitHub code, it gener...
Released: 3y 8m 25d ago on 02/07/2022
Version: 1.0

AlphaFold 3

open-source
protein-folding
AlphaFold 3 (announced May 2024) is DeepMind's latest protein folding model. It extends AlphaFold2 with new capabilities...
Released: 1y 5m 24d ago on 05/08/2024
Version: 3

BLOOM 176B

open-source
text
BLOOM (176B, 2022) is an open multilingual model trained by the BigScience workshop, supporting 46 languages and 13 prog...
Released: 3y 3m 20d ago on 07/12/2022
Version: 1.0

Claude 3.5 Sonnet

paid
text
Anthropic's Claude 3.5 Sonnet (2024) is an advanced conversational AI model built with constitutional alignment.
Released: 1y 4m 12d ago on 06/20/2024
Version: 3.5
Pricing:
  • details: Anthropic API pricing

CLIP

open-weight
multimodal
OpenAI CLIP (2021) is a multimodal model trained to associate images with text captions. It was trained on 400 million i...
Released: 4y 8m 27d ago on 02/05/2021
Version: 1.0

Command A

open-weight
text
Cohere's Command A (111B) is an open-weight, high-efficiency model (2025) with a 256K context window and support for 23 ...
Released: 7m 19d ago on 03/13/2025
Version: 1.0
Pricing:
  • details: free

DALL·E 3

paid
image
OpenAI DALL·E 3 (2023) is a text-to-image model that creates high-quality images from natural language prompts. It was d...
Released: 2y 13d ago on 10/19/2023
Version: 3.0

DeepSeek V3

open-source
text
DeepSeek V3 is an open-source 671B-parameter LLM with 128K context. Trained on 14.8T tokens, it excels at reasoning and ...
Released: 7m 8d ago on 03/24/2025
Version: V3
Pricing:
  • details: free

Doubao 1.5-Pro

proprietary
text
Doubao 1.5-Pro is ByteDance's proprietary LLM that surpasses OpenAI's o1 on a Chinese math reasoning benchmark. It offer...
Released: 9m 10d ago on 01/22/2025
Version: 1.5-Pro
Pricing:
  • details: 2 CNY/million tokens (32K context) to 9 CNY/million (256K context)

ERNIE 3.5

proprietary
text
Baidu's ERNIE 3.5 (2023) is a Chinese LLM powering Ernie Bot, focusing on knowledge-intensive tasks.
Released: 2y 5m ago on 06/01/2023
Version: 3.5
Pricing:
  • details: Via Ernie Bot subscriptions

ERNIE-ViLG 2.0

open-source
image
ERNIE-ViLG 2.0 is Baidu's 24B-parameter text-to-image diffusion model. It generates high-quality images from Chinese tex...
Released: 2y 11m 25d ago on 11/07/2022
Version: 2.0
Pricing:
  • details: free

EuroLLM-9B

open-source
text
EuroLLM-9B (Dec 2024) is a 9B-parameter open-source LLM built to cover all 24 EU languages. Developed under EU Horizon f...
Released: 10m 30d ago on 12/02/2024
Version: 1.0

Falcon 180B

open-source
text
TII's Falcon 180B (2023) is an open 180B-parameter LLM trained on 3.5T tokens, ranking among the top public models.
Released: 2y 1m 26d ago on 09/06/2023
Version: 1.0
Pricing:
  • details: free

Flamingo

research
text+image+video
Flamingo (NeurIPS 2022) is DeepMind's visual-language model that processes images, videos, and text together. It bridges...
Released: 3y 6m 3d ago on 04/29/2022
Version: 1.0

Gemini 1.5 Pro

paid
text image video audio
Google's multimodal foundation model for text, audio, video, and image understanding with long-context reasoning.
Released: 1y 8m 17d ago on 02/15/2024
Version: 1.5-pro
Pricing:
  • tier: per-minute compute
  • currency: USD
  • details: Pricing not public; available via Vertex AI

GLM-4-Voice

open-source
text audio
Zhipu AI's GLM-4-Voice (2024) is an open-source end-to-end speech model supporting Chinese and English.
Released: 1y 7d ago on 10/25/2024
Version: 4.0
Pricing:
  • details: free

GLM-4.5

open-source
text
GLM-4.5 is an open-source LLM by Zhipu AI (355B params) released under MIT license. It achieves top-tier results (63.2 b...
Released: 3m 4d ago on 07/28/2025
Version: 4.5
Pricing:
  • details: free

GPT-4o

paid
text image audio
OpenAI's flagship multimodal model capable of reasoning across text, images, and audio in real time.
Released: 1y 5m 19d ago on 05/13/2024
Updated: 9m 22d ago on 01/10/2025
Version: 4o
Pricing:
  • input_per_1k_tokens: 0.005
  • output_per_1k_tokens: 0.015
  • currency: USD
  • subscription_available: true

HunyuanVideo

open-source
video
HunyuanVideo is an open-source video generation model (13B params) from Tencent. It produces high-quality videos from te...
Released: 10m 29d ago on 12/03/2024
Version: 1.0
Pricing:
  • details: free

HunyuanWorld 1.0

open-source
multimodal
HunyuanWorld-1.0 (Tencent) is an open-source 3D world generation model. It builds interactive 3D environments from text ...
Released: 3m 6d ago on 07/26/2025
Version: 1.0
Pricing:
  • details: free

Kimi K2

open-source
text
Kimi K2 (Moonshot AI) is a 1T-parameter MoE LLM with a 128K context window. It achieves state-of-the-art coding performa...
Released: 3m 21d ago on 07/11/2025
Version: K2
Pricing:
  • details: free

Llama 2 70B

open-source
text
Meta's open-weight Llama 2 (70B) model released in July 2023, available for research and commercial use.
Released: 2y 3m 14d ago on 07/18/2023
Version: 2
Pricing:
  • details: free

Luminous-supreme

commercial API
text
Aleph Alpha's Luminous-supreme is a 70B-parameter text LLM (released April 2022) with multilingual training. It uses a d...
Released: 3y 7m ago on 04/01/2022
Version: 1.0

Mistral 7B

open-source
text
An open-weight 7B parameter model optimized for inference and cost efficiency.
Released: 2y 1m 5d ago on 09/27/2023
Version: 1.0
Pricing:
  • details: free

Mixtral 8x7B

open-source
text
Mixtral 8x7B is a 46.7B-parameter sparse Mixture-of-Experts LLM released in Dec 2023. It consists of eight 7B experts (u...
Released: 1y 10m 21d ago on 12/11/2023
Version: v0.1

MPT-7B

open-weight
text
MPT-7B (2023) is an open-source 6.7B-parameter language model. Trained on 1T tokens of text+code, it achieves performanc...
Released: 2y 5m 27d ago on 05/05/2023
Version: 1.0

MusicLM

research
audio
MusicLM (2023) is a Google research model for generating music from text descriptions. It uses a hierarchical sequence-t...
Released: 2y 3m ago on 08/01/2023
Version: 1.0

OpenAI Codex

paid
text-to-code
OpenAI Codex (2021) is a code generation AI derived from GPT-3 and fine-tuned on public code repositories. It can interp...
Released: 4y 2m 22d ago on 08/10/2021
Version: 1.0

OPT 175B

open-source
text
Meta's OPT-175B (2022) is an open 175B-parameter model released under a research license, comparable to GPT-3.
Released: 3y 4m 25d ago on 06/07/2022
Version: 1.0

PaLM 2

paid
text
Google's PaLM 2 (2023) is a next-gen LLM powering Bard, with enhanced multilingual and reasoning capabilities.
Released: 2y 5m 22d ago on 05/10/2023
Version: 2
Pricing:
  • details: See Google Cloud pricing

PaLM 2

paid
text
PaLM 2 (2023) is Google’s 340B-parameter language model. It was trained on extensive multilingual and code datasets (~78...
Released: 2y 5m 22d ago on 05/10/2023
Version: 2.0

Qwen3 (235B MoE)

open-source
text
Qwen3 is Alibaba's open-source LLM family. Its largest 235B-model (22B active) supports 119 languages. Trained on 36T to...
Released: 6m 3d ago on 04/29/2025
Version: 235B
Pricing:
  • details: free

Segment Anything Model (SAM)

open-weight
image
Meta AI’s Segment Anything Model (SAM, 2023) is an open-source image segmentation model. It was trained on Meta’s SA-1B ...
Released: 2y 6m 27d ago on 04/05/2023
Version: 1.0

SenseNova Unified Large Model (V6)

proprietary
multimodal
SenseTime's flagship multimodal LLM (600B params) integrates text and visual understanding. It achieves top scores on Ch...
Released: 9m 19d ago on 01/13/2025
Version: V6
Pricing:
  • details: Cloud/API access (enterprise licensing)

SparkDesk 4.0

proprietary
audio
SparkDesk 4.0 is iFlytek's flagship speech recognition model supporting 74 languages. It delivers high transcription acc...
Released: 1y 2m 17d ago on 08/15/2024
Version: 4.0
Pricing:
  • details: see website for pricing

Stable Audio Open

open-source
audio
Stable Audio Open (2023) is an open text-to-audio diffusion model by Stability AI. It produces high-quality stereo audio...
Released: 2y 1m 18d ago on 09/14/2023
Version: 1.0

Stable Diffusion XL 1.0

open-weight
image
Stable Diffusion XL (SDXL 1.0, July 2023) is a 3.5B-parameter image generation model. Using a two-stage pipeline (base +...
Released: 2y 3m 6d ago on 07/26/2023
Version: 1.0

Stable Video Diffusion

open-source
video
Stable Video Diffusion (2023) is Stability AI's text-to-video model based on Stable Diffusion. It generates short video ...
Released: 2y 2m ago on 09/01/2023
Version: 1.0

TildeOpen LLM

open-source
text
Tilde's TildeOpen is a 30B-parameter open-source LLM for European languages (released Sept 2025). Trained on a balanced ...
Released: 1m 3d ago on 09/29/2025
Version: 1.0

Titan Text Premier

paid
text
Amazon's Titan Text Premier (2024) is a 32K-context LLM on AWS Bedrock with strong QA and reasoning performance.
Released: 1y 5m 25d ago on 05/07/2024
Version: 1.0
Pricing:
  • input_per_1k_tokens: 0.0025
  • output_per_1k_tokens: 0.01
  • currency: USD
  • subscription_available: false

Whisper

open-weight
audio
OpenAI Whisper (2022) is an encoder-decoder Transformer ASR model trained on 680K hours of multilingual audio. It excels...
Released: 3y 1m 11d ago on 09/21/2022
Version: 1.0

Join our community

Connect with others, share experiences, and stay in the loop.