Mark tM - Oct 12, 2011:
Interesting thread so far
Ok Genesis, let me try to summarize phase 1. You will try to find patterns in combinations of letters, words and sentence-fragments. These will mostly be N-grams, so a certain combination of letters or words, correct?
Correct.
Mark tM - Oct 12, 2011:
If such an N-gram is perceived often enough, it becomes a ‘concept’ as you call it, but basically it means that the chance of perceiving this N-gram has passed a certain theshold.
Yep.
Mark tM - Oct 12, 2011:
One problem I spot here is data storage. Storing all possible combinations of letters, characters and words plus their chance of occurrence is a humongous job.
Not quite, the memory component of the system is not a typical storage database. It won’t store all combinations of letters. It will only store 26 objects, one being each letter of the alphabet. The combinations are links that connect to other objects its related to. So a link can be generated for the object “w” that connects to the objects “o”, “r” and “d”.
Each time those combination of letters occur, the link is strengthened. If it passes a threshold then a new concept is created based on the link as “word”. If it doesn’t pass the threshold then it will eventually be forgotten.
The combination of words (concepts) work in similar fashion. Words used together often are brought closer in the memory to form a cluster of words with each word in the cluster probably having its own cluster of words.
So we have a cluster of concepts with clusters of concepts. Think of it like the universe, which has galaxies, which in itself are clusters of stars and clusters of galaxies (galaxy groups). What brings these concepts together? The memory manager (See Pg. 1) Think of it as an outside gravity force, pulling concepts together and also capable to pull them back apart. These clustering will prove to be useful.
And its really not about chances of occurring, its how often a pattern occurs. You learn things by repetition. If something happens once, you will likely forget it. If something happened multiple times, it sticks some where deep in the recess of your mind.
Mark tM - Oct 12, 2011:
But let’s continue. After doing this, and feeding it training data to teach the system all the N-grams and chance, then what? You now have a database which tells you what kind of combinations occur with what chance. As Hunt already said, this is actually exactly what Google does. You say it’s different, because it does not understand words such as `Howesedfaboutttyrlnowfrt’ but that is because it is not trained to do so. It was not created to extract patterns and words from words like this. However, and this is the important part, with all the data that is in Google’s database, they could do this, because all the required information is in the database. It’s only a matter of statistics and calculating likelihoods of events.
The memory doesn’t deal with chance or statistics, its deals solely on how strong a concept and how close it is to other concepts and object. Chance and statistics is of itself a modeled concept in the brain, which is learned. But we can agree to disagree about what Google does and how its data in its DB is stored.
But there are other components. Google just does search and retrieval, “what” brings up “what is my ip”
There is no thinking (which is using the results to search the memory again for additional results.) You can call it genuine inspiration. Your mind continuously does this, we create and perceive in our mind simultaneously and our mind does this so well that we are not even aware that its happening.
Or generating inputs to query the memory with. I used this just yesterday on my Structured design test. I was stuck on a problem that asked the definition of a word (“_____ check”). I interrogated my mind by thinking “check…check…check…something check does…”
One result that activated was “desk checking”. Then I queried my mind for the meaning which resulted in the sentence meaning activating in my mind. But it didn’t match the definition of the word that I was looking. So I discarded it and started another query in which made apparent in the query to discard any anything that deals with “desk checking”.
But unfortunately i was unable to retrieve the answer. Why? I think its because I didn’t study that part of the test. But since it was a bonus question so its doesn’t hurt me anyway.
But this is how our mind works. I will get into that later.
Nor is there reasoning (comparing a concept with another concept)
Jan Bogaerts - Oct 12, 2011:
Genesis, When you say you are at 20% of phase 1, does that mean, coded or concept definition?
coded, I have about 305 lines of code. Which is good. There are 3 major parts of phase one.
1) Pattern Recognition (which consists of 40%)
2) Thinking (which consists of 20%)
3) and Reasoning (which consists of 40%)
I have only complete about 20% of pattern recog module. It can successfully map objects of letters and concepts of words in a controlled environment at the moment.
But there is a long way to go to get it where it should be. I will be working on the spatial and sequential part of the recognition all day today. I will also be programming three layers of the pattern recog. Depth is a necessity when you want to detect varieties of patterns.