Stage 1:
By giving out names for the objects in your territory, you're marking yourself with said objects (in memory), and when you've given out names for all objects that makes up your territory, that was when you're marked with the territory itself, always carrying it with you.
Stage 1 is building virtual territory/worldview.
Stage 2:
Since virtual territory/worldview is always with you, you can recall any objects of your territory just by pronouncing its name. They're available all the time. In virtual territory/worldview you can pay attention to a particular objects and their properties by pronouncing its name. You become AWARE of particular objects in this virtual territory/worldview.
Stage 2 is the building of awareness.
It's also the stage where you add additional marks to the objects for their properties, giving out meta for names.
In analogy of reading webnovels, it's like giving out titles for novels and giving out tags for said titles.
Stage 3:
You're the ruler of your territory, the constant that always interacts with various objects in the territory, but while all objects are available all the time, they're virtual objects with properties.
This means that when you interacts with said objects you don't have to deal with the whole objects but just its known properties, freeing yourself out of that is not necessary and allows you to interacts with more objects.
The most important point was that the interactions happened in virtual territory/worldview. In other world in your mind. This necessitates your own representative in your own mind, so you named yourself as subject in the virtual territory, you make virtual self.
Stage 3 is the building of self awareness.
Stage 4:
Stage 4 is giving out meta the virtual self for your own properties, completing your own understanding of yourself.
Stage 5:
Stage 5 is giving out names to the actions you do to your objects.
So now you're done with Subjects, Predicates and Objects and you're able to simulate actions and interactions between subjects and objects in your mind.
Stage 6:
Since you've become able to simulate actions and interactions, to convey the simulation to other subjects, to communicate with other human, you smoothed out the simulation so that other subjects can simulate said simulation in their own virtual territory/worldview/their mind.
Stage 6 is creating language.
Stage X: Education
Education is about transferring what simulation works to other human. But when it's done to children by giving out template of your own worldview to said children, isn't that like marking them with your own worldview?
It seems like sentience and civilisation is all about marking and re-marking the objects in your territory.