Best for
Unit 7 introductory lesson — before students use any AI tools. Sets the conceptual and ethical foundation for the rest of the unit.
Digital Technologies & AI Ethics • Unit 7 • Years 9–11 • Concepts
What is an LLM, and how does it actually work? Understanding the mechanics behind AI language tools is the first step toward using them responsibly — and evaluating them critically.
Want this adapted for your school's specific AI tools or local context? Te Wānanga can localise it — adding examples from the digital tools your students actually use and connecting to community digital sovereignty initiatives.
All content is provided. No supplementary materials needed to run this lesson.
This handout develops students' ability to understand digital systems and evaluate their societal impact — connecting to the NZ Curriculum's Digital Technologies strand and the broader values of critical thinking, equity, and cultural identity.
In Māori worldview, reo is a taonga — a treasure. Language carries culture, identity, and knowledge across generations. AI language models are trained almost entirely on English-language internet data. This means they often cannot handle te reo Māori well, may reproduce colonial framings of Aotearoa's history, and may generate content that misrepresents or flattens mātauranga Māori. Understanding how LLMs work is not just a tech skill — it's a foundation for digital tino rangatiratanga.
A Large Language Model is a type of artificial intelligence trained on a massive amount of text data. Its core function is predicting the next most likely word in a sequence — which enables it to generate human-like text, answer questions, translate languages, and much more.
LLMs are trained on vast datasets — internet text, books, articles. This is how they "learn" language patterns. But the data may not represent all languages, cultures, or perspectives equally. What voices are missing?
Internal variables the model uses to make predictions. Models are measured by parameter count — from millions to hundreds of billions. More parameters does not automatically mean better or more culturally aware output.
The input you give — your question or instruction. The clarity and specificity of your prompt significantly affects the quality and reliability of the output you receive.
You type a question or prompt
The model breaks your text into smaller pieces called tokens
The model uses its parameters to understand context and predict likely responses
It predicts and outputs the most statistically likely response — not necessarily the most accurate or culturally aware
LLMs reflect the biases in their training data. Indigenous knowledge and non-English perspectives are frequently underrepresented.
Training and running large models requires enormous computational energy. This has a real environmental cost.
How well do these models understand and respect mātauranga Māori and other indigenous knowledge systems? Often poorly.
What happens to data you share? Data sovereignty — the right of communities to control their own data — is a live issue for Māori.
Answer in your own words. Use the spaces below — do not use an AI tool to answer questions about AI tools.
1. How might the training data of an LLM affect its understanding of Māori perspectives and mātauranga Māori?
2. What are two potential benefits and two potential risks of using LLMs in education?
3. In te ao Māori, reo (language) is a taonga. What does this suggest about how we should treat AI-generated text in te reo Māori?
4. What three questions would you ask before trusting information from an AI model?
Complete questions 1 and 2. Draw or label the 4-step process flow. Discuss your answers with a partner before writing.
Complete all four reflection questions with specific examples where possible. Aim for 3–4 sentences per answer.
Research a real documented case of AI bias or indigenous language underrepresentation. Write a paragraph connecting it to the concepts here and bring it to the next lesson.
Level 4–5: Understand how digital systems work; explore how AI models are trained on data; evaluate the implications of AI-generated text for accuracy, bias, and cultural representation.
Level 3–4: Understand how technology shapes relationships, power, and identity within communities; evaluate the impacts of digital innovation on society and culture.
In te ao Māori, reo is a taonga — language carries culture, whakapapa, and tikanga across generations. Large language models are trained overwhelmingly on English-language data, which means te reo Māori is underrepresented and often mishandled. Understanding how LLMs work is foundational to digital tino rangatiratanga: the right of Māori communities to govern, protect, and define how their language and knowledge appear in AI systems. Kaitiakitanga of language is not a metaphor — it is a live responsibility in the age of AI.
Students will develop critical digital literacy by examining the ethical dimensions of AI systems, exploring how kaupeka matihiko (digital technologies) reflect and shape our values, and connecting concepts of tino rangatiratanga (self-determination) to digital sovereignty and data rights in Aotearoa.
Scaffold support: Provide worked examples of AI bias scenarios with entry-level sentence starters. Offer extension tasks requiring students to research and present a case study of algorithmic injustice affecting indigenous communities.
ELL / ESOL: Pre-teach key digital technology vocabulary (algorithm, bias, data, sovereignty). Allow students to discuss concepts in home language before writing in English.
Inclusion: Use accessible formats with clear headings and visual supports. Neurodiverse learners benefit from structured ethical frameworks (e.g. decision trees) to navigate complex AI ethics scenarios.
Mātauranga Māori lens: Connect AI ethics to tikanga Māori values — particularly kaitiakitanga of data (who owns and controls information about Māori communities) and the principle of manaakitanga in how technologies should serve people equitably. Discuss the risks of algorithmic bias replicating colonial harm.
Prior knowledge: Best used after introductory digital technology concepts. No specialist prior knowledge required for entry-level engagement.