🔬 What AI Studio actually is
Google AI Studio (aistudio.google.com) is a free browser-based environment for working with Gemini models. It's not a product — it's a laboratory. You're talking directly to the model, with full control over the system prompt, temperature, context, and which model version you use.
It's different from Gemini.google.com (Google's consumer chatbot) in the same way a kitchen is different from a restaurant. Gemini.google.com serves you a meal. AI Studio puts you behind the stove.
Why this matters: Most AI tools put a layer of product between you and the model. AI Studio removes that layer. You can see exactly what's being sent to the model, tune the parameters, and get an API key to use the same model in your own tools — all for free.
The main model to know right now is Gemini 2.0 Flash: fast, free (with generous limits), multimodal (images, audio, video, PDFs), and genuinely capable. It's the right model for most things you'll build here.
🗺️ The interface — five things worth knowing
🖼️ Multimodal — what you can actually do
Gemini's multimodal capabilities are genuinely useful beyond novelty. Here are the practical use cases:
- Invoice and receipt reading — photograph a paper receipt, ask Gemini to extract the line items into a structured list. Useful for accounts without scanning software.
- Document summarisation — upload a PDF (council report, contract, lease, creative brief) and ask for a 5-point summary with key decisions highlighted.
- Image description for accessibility — paste in an image, ask for an alt-text description. Useful for anyone building websites or social media posts that need accessibility.
- Whiteboard and handwritten notes — photograph your planning notes and ask Gemini to turn them into a structured document. The OCR quality on handwriting is surprisingly good.
- Audio transcription — upload an audio file (up to ~1 hour for Gemini 1.5 Pro) and ask for a transcript. Faster and cheaper than dedicated transcription tools for one-off jobs.
Privacy note: What you upload to AI Studio goes to Google's servers. Don't upload sensitive personal information, client data, or anything you wouldn't want processed by a third party. For sensitive documents, use a local model (Ollama + Llama) or read the data privacy terms carefully.
🔑 Getting and using your API key
An API key lets you call Gemini from code — your own scripts, tools you build, integrations with other apps. Getting one takes 30 seconds.
import google.generativeai as genai
genai.configure(api_key="YOUR_API_KEY_HERE")
model = genai.GenerativeModel("gemini-2.0-flash")
response = model.generate_content("Summarise what vibe coding is in 3 sentences.")
print(response.text)
Free tier limits: Gemini 2.0 Flash on the free tier gives you 15 requests/minute and 1 million tokens/day. For most personal and small-business uses, you will never hit these limits. Paid tier (via Google Cloud) starts at $0.075 per 1M input tokens — very cheap for production use.
⚖️ AI Studio vs Claude vs ChatGPT — when to use which
Practical rule: Use AI Studio when you're building or need an API. Use Claude when the writing quality matters and you need it to follow precise instructions. Use ChatGPT when you need images alongside text, or when you're helping someone less technical who already has an account.
🎙️ NotebookLM — Google's secret weapon
NotebookLM (notebooklm.google.com) is a separate Google product built on Gemini, but it deserves a section here because it's one of the most immediately useful AI tools for businesses — and most people haven't heard of it.
The core idea: you upload your documents (PDFs, Google Docs, web pages, YouTube transcripts), and NotebookLM becomes an expert on those specific documents. It won't hallucinate facts from outside your sources — it stays grounded in what you gave it. Then it can do something remarkable: turn your documents into a podcast.
The Audio Overview feature: Give NotebookLM any set of documents and click "Generate Audio Overview." Two AI hosts will have a natural, conversational discussion about the key ideas — complete with analogies, back-and-forth, and summary. It sounds eerily real. A 40-page PDF becomes a 12-minute commute listen.
Real NZ business use cases:
- Staff onboarding — upload your policies, procedures, and FAQs. New staff explore using chat instead of reading every PDF. Generate an audio briefing for the first week.
- Client education — convert a proposal or service document into a "here's what you're getting" podcast that clients actually listen to
- Internal knowledge base — staff ask questions and get answers grounded only in your company's actual documents, not the internet
- Content repurposing — turn a blog post, report, or email newsletter into a podcast episode for low-effort audio content
- Meeting prep — upload the agenda and background docs; ask "what are the three things I need to know before this meeting?"
Why it's better than just asking ChatGPT: NotebookLM cites sources — every answer links back to the specific passage in your document. You can verify it's not making things up. For legal, financial, or compliance documents, this is the only responsible way to use AI as a research tool.
Go to notebooklm.google.com (free, needs a Google account). Upload one document — a proposal, your website, a services PDF. Then:
- Ask it a question in the chat — "What are the key services offered?" or "What should a new client know?"
- Click Audio Overview → Generate. Wait 2–3 minutes.
- Listen to the result. Notice the conversational tone.
For B2B use: NotebookLM is particularly powerful when you have a specific knowledge base to ground it in. A law firm's FAQ, a hospitality group's menu and policies, a school's parent handbook — the tool shines when the source material is defined.
👇 Your exercise
Pick one of these real use cases. Open AI Studio, write the system instruction, and test it.
- Email reply assistant — System instruction: "You are a professional email assistant. When given an email, write a concise, warm, professional reply in NZ English. Don't start with 'I hope this finds you well.'" Then paste in a real email you need to reply to.
- Menu description writer — System instruction: "You write enticing, honest menu descriptions. Under 30 words each. No food clichés (no 'succulent', 'artisan', 'decadent'). Warm and specific." Then give it your dishes.
- Document Q&A — Upload a PDF (your lease, a council document, a contract). Ask: "What are the three most important things I need to know in this document? Are there any deadlines or action items?"
Once it works: click "Get code" and save the Python snippet. That's the start of a real tool.
Branch complete 🔬
AI Studio is the clearest path from "I tried the chatbot" to "I built something." The API key is the unlock — once you have it, you can connect AI to anything.