Adobe Firefly 2

paid

multimodal

Adobe Firefly 2 (2023) is a suite of generative AI models for creatives. Its Image 2 model creates high-quality images w...

Released: 2y 22d ago on 10/10/2023

Version: 2.0

AlexaTM 20B

Amazon

paid

text

AlexaTM 20B (2022) is Amazon’s 20B-parameter multilingual seq2seq model. It achieves state-of-the-art performance on tas...

Released: 3y 10m ago on 01/01/2022

Version: 20B

AlphaCode

DeepMind (Google)

research

code

AlphaCode is a 41.4B-parameter transformer model for code generation (Feb 2022). Trained on public GitHub code, it gener...

Released: 3y 8m 25d ago on 02/07/2022

Version: 1.0

AlphaFold 3

DeepMind (Google)

open-source

protein-folding

AlphaFold 3 (announced May 2024) is DeepMind's latest protein folding model. It extends AlphaFold2 with new capabilities...

Released: 1y 5m 24d ago on 05/08/2024

Version: 3

BLOOM 176B

BigScience

open-source

text

BLOOM (176B, 2022) is an open multilingual model trained by the BigScience workshop, supporting 46 languages and 13 prog...

Released: 3y 3m 20d ago on 07/12/2022

Version: 1.0

Claude 3.5 Sonnet

Anthropic

paid

text

Anthropic's Claude 3.5 Sonnet (2024) is an advanced conversational AI model built with constitutional alignment.

Released: 1y 4m 12d ago on 06/20/2024

Version: 3.5

Pricing:

details: Anthropic API pricing

CLIP

OpenAI

open-weight

multimodal

OpenAI CLIP (2021) is a multimodal model trained to associate images with text captions. It was trained on 400 million i...

Released: 4y 8m 27d ago on 02/05/2021

Version: 1.0

Command A

Cohere

open-weight

text

Cohere's Command A (111B) is an open-weight, high-efficiency model (2025) with a 256K context window and support for 23 ...

Released: 7m 19d ago on 03/13/2025

Version: 1.0

Pricing:

details: free

DALL·E 3

OpenAI

paid

image

OpenAI DALL·E 3 (2023) is a text-to-image model that creates high-quality images from natural language prompts. It was d...

Released: 2y 13d ago on 10/19/2023

Version: 3.0

DeepSeek V3

DeepSeek AI

open-source

text

DeepSeek V3 is an open-source 671B-parameter LLM with 128K context. Trained on 14.8T tokens, it excels at reasoning and ...

Released: 7m 8d ago on 03/24/2025

Version: V3

Pricing:

details: free

Doubao 1.5-Pro

ByteDance

proprietary

text

Doubao 1.5-Pro is ByteDance's proprietary LLM that surpasses OpenAI's o1 on a Chinese math reasoning benchmark. It offer...

Released: 9m 10d ago on 01/22/2025

Version: 1.5-Pro

Pricing:

details: 2 CNY/million tokens (32K context) to 9 CNY/million (256K context)

ERNIE 3.5

Baidu

proprietary

text

Baidu's ERNIE 3.5 (2023) is a Chinese LLM powering Ernie Bot, focusing on knowledge-intensive tasks.

Released: 2y 5m ago on 06/01/2023

Version: 3.5

Pricing:

details: Via Ernie Bot subscriptions

ERNIE-ViLG 2.0

Baidu

open-source

image

ERNIE-ViLG 2.0 is Baidu's 24B-parameter text-to-image diffusion model. It generates high-quality images from Chinese tex...

Released: 2y 11m 25d ago on 11/07/2022

Version: 2.0

Pricing:

details: free

EuroLLM-9B

EuroLLM consortium

open-source

text

EuroLLM-9B (Dec 2024) is a 9B-parameter open-source LLM built to cover all 24 EU languages. Developed under EU Horizon f...

Released: 10m 30d ago on 12/02/2024

Version: 1.0

Falcon 180B

Technology Innovation Institute (TII)

open-source

text

TII's Falcon 180B (2023) is an open 180B-parameter LLM trained on 3.5T tokens, ranking among the top public models.

Released: 2y 1m 26d ago on 09/06/2023

Version: 1.0

Pricing:

details: free

Flamingo

DeepMind (Google)

research

text+image+video

Flamingo (NeurIPS 2022) is DeepMind's visual-language model that processes images, videos, and text together. It bridges...

Released: 3y 6m 3d ago on 04/29/2022

Version: 1.0

Gemini 1.5 Pro

Google DeepMind

paid

text image video audio

Google's multimodal foundation model for text, audio, video, and image understanding with long-context reasoning.

Released: 1y 8m 17d ago on 02/15/2024

Version: 1.5-pro

Pricing:

tier: per-minute compute
currency: USD
details: Pricing not public; available via Vertex AI

GLM-4-Voice

Zhipu AI (Z.ai)

open-source

text audio

Zhipu AI's GLM-4-Voice (2024) is an open-source end-to-end speech model supporting Chinese and English.

Released: 1y 7d ago on 10/25/2024

Version: 4.0

Pricing:

details: free

GLM-4.5

Zhipu AI (Z.ai)

open-source

text

GLM-4.5 is an open-source LLM by Zhipu AI (355B params) released under MIT license. It achieves top-tier results (63.2 b...

Released: 3m 4d ago on 07/28/2025

Version: 4.5

Pricing:

details: free

GPT-4o

OpenAI

paid

text image audio

OpenAI's flagship multimodal model capable of reasoning across text, images, and audio in real time.

Released: 1y 5m 19d ago on 05/13/2024

Updated: 9m 22d ago on 01/10/2025

Version: 4o

Pricing:

input_per_1k_tokens: 0.005
output_per_1k_tokens: 0.015
currency: USD
subscription_available: true

HunyuanVideo

Tencent Hunyuan

open-source

video

HunyuanVideo is an open-source video generation model (13B params) from Tencent. It produces high-quality videos from te...

Released: 10m 29d ago on 12/03/2024

Version: 1.0

Pricing:

details: free

HunyuanWorld 1.0

Tencent Hunyuan

open-source

multimodal

HunyuanWorld-1.0 (Tencent) is an open-source 3D world generation model. It builds interactive 3D environments from text ...

Released: 3m 6d ago on 07/26/2025

Version: 1.0

Pricing:

details: free

Kimi K2

Moonshot AI

open-source

text

Kimi K2 (Moonshot AI) is a 1T-parameter MoE LLM with a 128K context window. It achieves state-of-the-art coding performa...

Released: 3m 21d ago on 07/11/2025

Version: K2

Pricing:

details: free

Llama 2 70B

Meta AI

open-source

text

Meta's open-weight Llama 2 (70B) model released in July 2023, available for research and commercial use.

Released: 2y 3m 14d ago on 07/18/2023

Version: 2

Pricing:

details: free

Luminous-supreme

Aleph Alpha

commercial API

text

Aleph Alpha's Luminous-supreme is a 70B-parameter text LLM (released April 2022) with multilingual training. It uses a d...

Released: 3y 7m ago on 04/01/2022

Version: 1.0

Mistral 7B

Mistral AI

open-source

text

An open-weight 7B parameter model optimized for inference and cost efficiency.

Released: 2y 1m 5d ago on 09/27/2023

Version: 1.0

Pricing:

details: free

Mixtral 8x7B

Mistral AI

open-source

text

Mixtral 8x7B is a 46.7B-parameter sparse Mixture-of-Experts LLM released in Dec 2023. It consists of eight 7B experts (u...

Released: 1y 10m 21d ago on 12/11/2023

Version: v0.1

MPT-7B

MosaicML

open-weight

text

MPT-7B (2023) is an open-source 6.7B-parameter language model. Trained on 1T tokens of text+code, it achieves performanc...

Released: 2y 5m 27d ago on 05/05/2023

Version: 1.0

MusicLM

Google

research

audio

MusicLM (2023) is a Google research model for generating music from text descriptions. It uses a hierarchical sequence-t...

Released: 2y 3m ago on 08/01/2023

Version: 1.0

OpenAI Codex

OpenAI

paid

text-to-code

OpenAI Codex (2021) is a code generation AI derived from GPT-3 and fine-tuned on public code repositories. It can interp...

Released: 4y 2m 22d ago on 08/10/2021

Version: 1.0

OPT 175B

Meta AI

open-source

text

Meta's OPT-175B (2022) is an open 175B-parameter model released under a research license, comparable to GPT-3.

Released: 3y 4m 25d ago on 06/07/2022

Version: 1.0

PaLM 2

Google

paid

text

Google's PaLM 2 (2023) is a next-gen LLM powering Bard, with enhanced multilingual and reasoning capabilities.

Released: 2y 5m 22d ago on 05/10/2023

Version: 2

Pricing:

details: See Google Cloud pricing

PaLM 2

Google

paid

text

PaLM 2 (2023) is Google’s 340B-parameter language model. It was trained on extensive multilingual and code datasets (~78...

Released: 2y 5m 22d ago on 05/10/2023

Version: 2.0

Qwen3 (235B MoE)

Alibaba Cloud

open-source

text

Qwen3 is Alibaba's open-source LLM family. Its largest 235B-model (22B active) supports 119 languages. Trained on 36T to...

Released: 6m 3d ago on 04/29/2025

Version: 235B

Pricing:

details: free

Segment Anything Model (SAM)

SenseNova Unified Large Model (V6)

SenseTime

proprietary

multimodal

SenseTime's flagship multimodal LLM (600B params) integrates text and visual understanding. It achieves top scores on Ch...

Released: 9m 19d ago on 01/13/2025

Version: V6

Pricing:

details: Cloud/API access (enterprise licensing)

SparkDesk 4.0

iFlytek

proprietary

audio

SparkDesk 4.0 is iFlytek's flagship speech recognition model supporting 74 languages. It delivers high transcription acc...

Released: 1y 2m 17d ago on 08/15/2024

Version: 4.0

Pricing:

details: see website for pricing

Stable Audio Open

Stability AI

open-source

audio

Stable Audio Open (2023) is an open text-to-audio diffusion model by Stability AI. It produces high-quality stereo audio...

Released: 2y 1m 18d ago on 09/14/2023

Version: 1.0

Stable Diffusion XL 1.0

Stability AI

open-weight

image

Stable Diffusion XL (SDXL 1.0, July 2023) is a 3.5B-parameter image generation model. Using a two-stage pipeline (base +...

Released: 2y 3m 6d ago on 07/26/2023

Version: 1.0

Stable Video Diffusion

Stability AI

open-source

video

Stable Video Diffusion (2023) is Stability AI's text-to-video model based on Stable Diffusion. It generates short video ...

Released: 2y 2m ago on 09/01/2023

Version: 1.0

TildeOpen LLM

Tilde

open-source

text

Tilde's TildeOpen is a 30B-parameter open-source LLM for European languages (released Sept 2025). Trained on a balanced ...

Released: 1m 3d ago on 09/29/2025

Version: 1.0

Titan Text Premier

Amazon Web Services

paid

text

Amazon's Titan Text Premier (2024) is a 32K-context LLM on AWS Bedrock with strong QA and reasoning performance.

Released: 1y 5m 25d ago on 05/07/2024

Version: 1.0

Pricing:

input_per_1k_tokens: 0.0025
output_per_1k_tokens: 0.01
currency: USD
subscription_available: false

Whisper

OpenAI

open-weight

audio

OpenAI Whisper (2022) is an encoder-decoder Transformer ASR model trained on 680K hours of multilingual audio. It excels...

Released: 3y 1m 11d ago on 09/21/2022

Version: 1.0

Join our community

LinkedIn

Facebook

YouTube

Twitter

Instagram

TikTok