What Working Memory Actually Means for Dance Teachers
- Mar 26
- 6 min read
You have probably seen it happen a hundred times. You teach a combination, step by step, with plenty of repetition. Students nod. They try it. And then you add one more eight-count and the first section vanishes. This does not happen because they were not paying attention or because your students do not care, but because their working memory ran out of room.
Understanding working memory is not an academic exercise. It is one of the most practical things a dance teacher can know, because it explains so much of what happens in the studio every single day.
If you have been reading this blog for a while, you have been meeting working memory all along, just without the formal introduction. The post on chunking is about how to package information so working memory can handle it. The posts on retrieval cues explain how spatial anchors reduce the cognitive load of remembering where you are in a phrase. The post on prior knowledge describes how long-term memory frees up working memory space for new learning, and the post on transfer of learning is really about what happens when working memory is no longer doing the heavy lifting at all. This post is the foundation underneath all of those writings. Think of it as going back to the beginning so that everything else makes a little more sense.
So What Is Working Memory, Exactly?
Working memory is the part of your cognitive system that holds and manipulates information in the present moment. Think of it as the mental workspace where active thinking happens. It is where you hold the phrase you are currently learning while also watching yourself in the mirror and listening to the music and tracking where your partner is in space.
The research of cognitive psychologist George Miller (1956) established that working memory has a limited capacity, famously described as seven items, plus or minus two. Later work by Baddeley and Hitch (1974) refined this further, showing that working memory is not a single container but a system with distinct components, including a space for verbal information, a separate space for visual and spatial information, and a central executive that manages attention between them.
For dance teachers, this matters enormously. When a student is processing verbal instruction, tracking their body in space, following musical cues, and monitoring the dancers around them, they are drawing on all of these systems at once. The load is real, and it has a ceiling.
What Happens When Working Memory Overloads
When the demands on working memory exceed its capacity, something gets dropped. Sometimes, it is whatever came earliest. This is why students can execute the ending of a combination brilliantly but cannot tell you how it began.
Additionally, students may be able to move through the beginning of a piece with ease yet then stumble through the ending. Again, this is due to working memory holding and computing the first dance combinations taught and not having room for more material. It is not a motivation problem or an attention problem. It is a capacity problem.
Cognitive load theory, developed by John Sweller (1988), describes this as cognitive overload. When instruction introduces too much information too quickly, without adequate structure or scaffolding, the system cannot process it all. New learning does not make it into long-term memory because there was not enough working memory space to do the encoding work in the first place.
For dance teachers, cognitive overload can look like: students who blank mid-combination, dancers who learn phrases in isolation but lose them once the music adds another layer, or groups who seem to regress after a correction. None of these are character flaws. They are working memory doing exactly what working memory does.
Long-Term Memory Is Where We Are Actually Trying to Go
Working memory and long-term memory are not the same thing, and the relationship between them is everything in learning.
Long-term memory has essentially unlimited capacity. When we have learned something deeply, it no longer takes up working memory space because it has been encoded into long-term memory as a chunk. An experienced ballet or jazz dancer does not use working memory to remember that a pas de bourree has three weight shifts. That information lives in long-term memory and gets retrieved automatically, freeing working memory to attend to quality and artistry.
The goal of teaching, then, is to move material from working memory into long-term memory through repeated, varied, and meaningful engagement. This is not just about drilling. Retrieval practice, spaced repetition, and connecting new material to what students already know all accelerate this transfer.
This is also why prior knowledge matters so much – including one’s social and cultural “funds of knowledge,” yet that is another longer blog. A student who already has ballet vocabulary encoded in long-term memory can learn a new ballet-influenced combination with far less working memory demand than a student who is building that vocabulary from scratch. They are not smarter or more talented. They simply have more cognitive resources available for the new information.
What This Means for How You Teach
Once you understand working memory, certain studio practices start to make a lot more sense, and others start to look like unnecessary obstacles.
Teach in chunks, not streams. Rather than teaching an entire phrase from start to finish, build it in small, manageable units that students can encode before you add more. This is the principle behind chunking, and it is one of the most evidence-supported practices in learning science. When chunks are mastered individually, they take up far less working memory when combined.
Use spatial anchors and retrieval cues. Visual and spatial information occupies a different part of working memory than verbal information. When you give students a physical location in the room, a spot on the floor, a directional marker, something to see and orient to, you are distributing the cognitive load across multiple systems rather than piling everything into one. This is the core science behind the Dance Dots system: spatial cues act as retrieval anchors that reduce the working memory demand of remembering where in the phrase you are.
Slow down before you speed up. Running material at full tempo before it is encoded puts enormous pressure on working memory. Students are using cognitive resources just to keep up with the speed, which leaves less capacity for actual learning. Practicing slowly and correctly encodes the pattern into long-term memory. Speed comes from retrieval, not from rushing encoding. (There are apps to help you slow down your music tempo as well!)
Give working memory a moment to breathe. Pauses, reflection prompts, and even brief rests between teaching segments give the cognitive system time to consolidate what it just received. What feels like downtime is actually essential processing time.
Connect new material to what students already know. When you anchor new choreography to vocabulary students have already encoded in long-term memory, again - including vocabulary outside the dance style and dance language you are teaching, you reduce the working memory demand of the new material. It is not just good pedagogy to say, remember that port de bras we did last month? This is similar. It is working memory management.
The Teacher's Working Memory Too
It is worth naming that you are managing your own working memory while you teach. You are tracking multiple students, monitoring music timing, adjusting on the fly, holding the rest of the class plan in mind, and responding to what is happening in the room. This is a significant cognitive load.
Routines, consistent class structures, and prepared materials reduce the working memory demand on you as the teacher, which frees your attention for what requires it: seeing your students clearly and responding to what they need in the moment.
What You Are Actually Doing When You Teach Well
Every time you break a combination into sections before building it, you are managing cognitive load. Every time you use a count, a spatial marker, or a movement cue to help students locate themselves in a phrase, you are creating a retrieval anchor. Every time you connect new vocabulary to something familiar, you are reducing the working memory burden of new learning.
You have probably been doing versions of this intuitively for years. The research just gives language to what your best teaching instincts already understood - learning happens when there is enough cognitive space for it to take root.
Working memory is not a limitation to work around. It is a system to work with, and once you understand how it functions, everything from how you sequence instruction to how you use space in the studio starts to look a little different.
Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63(2), 81-97.
Baddeley, A. D., & Hitch, G. (1974). Working memory. In G. H. Bower (Ed.), The psychology of learning and motivation (Vol. 8). Academic Press.
Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257-285.




Comments