4

第四章

AI 工具
全景图

一份你现在就能用的
AI 工具实战指南。

LAST UPDATED MARCH 2026

大多数人说"AI"的时候,其实是在说 ChatGPT。这就好比说"互联网"但其实只是在说 Google。现在,有数百种 AI 工具你今天就能用——能用一句话生成逼真的图片、谱出完整的歌曲、或者阅读你研究课题的每一篇论文然后用三段话总结出学术共识。这一章就是你的地形图。

八大家族

把它想象成乐器。钢琴和架子鼓都是"乐器",但你不会让钢琴家去打鼓。AI 工具也是一样——专门做图像生成的工具和专门做代码的工具本质上完全不同。你会遇到的每一个工具,都属于这八大家族之一。

1

图像生成

输入文字,输出图片。Midjourney、DALL-E 3、Stable Diffusion、Flux、Ideogram。

2

图像编辑

从一张真实图片开始做变换。Photoshop AI、Magnific、Clipdrop。

3

视频创作

用文字或图片生成视频片段。Sora、Runway、Kling、Pika。

4

音乐与音频

用提示词生成完整歌曲、声音克隆、音效。Suno、Udio、ElevenLabs。

5

研究助手

AI 替你阅读互联网。Perplexity、Elicit、Consensus、NotebookLM。

6

AI 浏览器

像人一样浏览网页的智能体。Arc、Operator、Claude Computer Use。

7

编程工具

能写代码、改代码、调试代码的 AI。Claude Code、Cursor、Antigravity、Copilot。

8

聊天机器人与助手

你已经熟悉的通用对话式 AI:ChatGPT、Claude、Gemini。

核心洞察

浅浅了解10个 AI 工具的人,会持续碾压只深入掌握一个工具的人。专业木匠没有"最爱的工具"——他们对每个任务都有最合适的工具。

浏览完整目录。点击任何工具了解它的用途。

The Tool Wall

24 tools across 10 categories

Image GenPaid

Midjourney v8

The gold standard for artistic, cinematic image quality

click for details
Image GenImage EditFreemium

Nano Banana 2

Google's powerhouse — fastest gen, best text, seamless editing

click for details
Image GenImage EditFreemium

Luma Uni-1

Reasoning-first image model — thinks before it renders, #1 for editing

click for details
Image EditFreemium

Photoroom

Instant studio-quality product photography from phone photos

click for details
VideoFreemium

Seedance 2.0

Best motion consistency and cinematic storytelling

click for details
VideoFreemium

Luma Ray3

Most cinematic — first HDR video gen, start/end frame control

click for details
VideoFreemium

Hailuo / MiniMax

Best human motion and facial expressions, most affordable

click for details
VideoFreemium

Kling 2.6

Longest AI videos (2 min), generous free tier, very affordable

click for details
MusicFreemium

Suno

Full songs with vocals, instruments, and structure from a prompt

click for details
MusicFreemium

Google Lyria

Google DeepMind's music model — vocals, lyrics, and instrumentals from text or image prompts

click for details
Voice & AudioFreemium

ElevenLabs

Most realistic voice cloning and text-to-speech, period

click for details
ResearchFreemium

Perplexity

AI search engine — answers with cited sources in real time

click for details
ResearchFree

NotebookLM

Upload your docs, get AI grounded only in your sources

click for details
Browsers & AgentsFree

Dia

AI-first browser — your URL bar is now an assistant

click for details
Browsers & AgentsPaid

Claude Computer Use

Claude controls your desktop — opens apps, navigates, types

click for details
CodingFreemium

Cursor

Best AI code editor — codebase-wide reasoning in a visual IDE

click for details
CodingPaid

Claude Code

Terminal AI agent — reads, writes, runs, and debugs your code

click for details
AggregatorsFreemium

Krea AI

Creative cockpit — image, video, 3D, upscaling from top models

click for details
AggregatorsFreemium

Poe

200+ AI models in one interface — text, image, video, audio

click for details
AggregatorsFreemium

Genspark

AI workspace — routes tasks across 9+ models that cross-check each other

click for details
AggregatorsFreemium

OpenRouter

One API key for 100+ models from every provider

click for details
OtherFree

Google Stitch

Prompt-to-prototype — generates real UI from text or sketches

click for details
OtherFree

Google Pomelli

AI marketing agency in a box — campaigns from your website URL

click for details
OtherFreemium

Descript

Edit audio and video by editing text — like a doc, not a timeline

click for details

This landscape changes fast. New tools appear every week.

Share this course
AI 版图每个月都在变。但类别是稳定的,即使工具在变——这才是值得记住的地图。

到目前为止,这些工具只是在回应你。如果它们能自己采取行动呢?这就是从工具到智能体的飞跃——它改变了一切。

Dream Project

New tool unlocked!