Hello
I was just wondering why the contest dont has different flavours, not only to fool a judge but to get a nice conversation at any open theme.
Why not introduce specific topics ot themes, like getting a product selection when the user has a specific need, even to get some help or advice about some real thing, product or service, not a wholy chatter blah blah
I mean this because I am thinking a real humanlike IA will be far away in time to fool a human, but useful chatterbots will help in many indusrty sections and markets. And the prize may help the industry to drift into a useful direction.
I am shure that the goal of a conversation and the quality of the bot in dealing with the whole conversation mechanics are more important for any company than to fool the user he is talking to a human. The user should know it and even thoug feel comfortable with the bot. This may be a useful hint.
Also I saw and tested even the most daring and prize-winner bots, and they dissapointed me a lot. Their conversation had no purpose other than to fool me and maintain my attention to drift to whatever the bot has more answers. The conversations were all very stupid and of a type like question-answer-factic, no ellaboration, no memory other than a few words or topics, repeated with pre-built phrases, no conversation thread following capability, no… nothing human at all. I could test many of them, even the ones who are ´commercial-grad´ like the ones made by virtuoz for the star alliance and this one was very good, but had no capability to understand other than a few keywords and pre-built phrases, i tested him a lot, and it kept answering allways the same nonsenses to the same queries.
I would like to see bots with Language Generation Capability, with inventive to handle situations on the conversation, like a long silence, too much nonsenses as input, repeated questions, insult andling, or silly answers detection and even facing stupid or grotesque words detection, a bot who could read like a human, and see through text thing like: hhhhheeeeellloooooo, how are youuuuuu!! or detect that jnsdjkbcgsyughdks hfjd kkihd are no words at all!
Also, let me be crude: English is a piece of cake, to detect patterns, why not try with a higly inflected language like spanish, polish, french, italian or potuguese ?
Also in my own experience AIML is not (at leas simply) capable to handle Spanish in an efficient way, I coud not make it work decently at all, many years ago.
I am personally making all the pieces of - complex stuff - to handle even a small part of the inflected Spanish, and its a hell of a complexity, there is a real need for brilliant programmers to bring a chatterbot to speak spanish, not just to sit behind a stemmer like snowball! to simplify up things. (like some of the ones I´ve tested out)
Some of the challenges I´ve faced:
- In English you can get a deep & rich ontology like wordnet, for free
- In Spanish there is not! :(
In Spanish there are more verbs and verb-phrases combinations than phrasals in all England and US territory
There are some scientific papers stating the ammount of inflections needed to understand Spanish, ad the number is simply huge over 9 zeros and counting!
Also there are too much prefixes and suffixes used in Spanish on all nouns and adjectives, I collected about 900 common prefixes and over 800 suffixes, many of them are also capable to get infixed, so the combinations are for each word circa a million (to be exact, not all words admit all prefixes and suffixes, but at least a third of them have a affordable semantic meaning to build parasynthetic words, so we may have more than 10k variations for each noun or adjective, and in a typical Spanish dictionary we have 40k nouns, so 100M noun variations may be a good estimation. For verbs we have 70 inflections (tense, person, number, mode, gender) and in verbs the prefix liest may be fully useful, because they normally modify the semantic meaning of the verbs, then we have 900x70, ad we got 30k verbs (basically) then we got 300M verbal forms. The adverbs are about the same as verbs, adn you can build adjectives out of some verb variation-inflection (derivations) so you got a huge and wild bunch of word-forms out there to deal with.
This may be also a challenging thing to deal with and do something useful around this.
Even the spell corrections (mistyping) should be handled by a bot, because humans do this very well indeed!
I saw no bot capable of dealing with more than one letter mistake! (all of the bots missed the keyword, when a mistake arouse just there!)
Even more, why a bot cannot manage word-similarity parameters? like humans, or even google with its huge statistic rich human feeding source, getting an almost astonishing: did you mean xxxxx?
Dr Loebner:
Those are questions and things to add to enhance a challenging contest like the Loebner Prize, and the outcome shold be many scores, like
- best spell.checker and bad writing interpreter
- best turn-taking managing-bot (happiest human experience)
- best ´intelligent´ bot to understand and defeat elegantly silly inputs
- best and smarter goal follower bot
- best helpful hinter bot (for a product hint or advice)
- best service technician/advisor bot (for thechnical support)
- best ´seller´ bot (for a offering smart-selling bot)
- best humorious bot, for a bot who could take any input and create real jokes (not pre-written)
etc-
and each of those categories should be available in any language
I believe that at first time there will be no participants in many languages, but there will be a statistic on how much is done in any language as time goes by, this will give an impulse to other bots engines, in other languages.
This is just a purposal to enhance the challenge
best regards
Andres H