BeagleBoard/GSoC/Modern Speak and Spell
- 1 ProposalTemplate
- 2 Status
- 3 Proposal
- 3.1 About Me
- 3.2 Modern “Speak & Spell” using PocketBeagle
- 3.2.1 Description
- 3.2.2 Modern “Speak & Spell” project overview
- 3.2.3 Detailed Description
- 3.2.4 Progress till now
- 3.2.5 Timeline:
- 3.3 Final Goals:
- 3.4 Experience
- 3.5 Benefit
Recreating and improving the functionality of the previous Speak and Spell toy by Texas Instruments, and generate open-source code for it.
Student: Anirban Banik
Mentors: Jason Kridner
To celebrate the 40th anniversary of the “Speak & Spell” from Texas instruments, this proposal was accepted in order to create an updated “Speak & Spell” using a PocketBeagle.
I have completed the task required as described on the ideas page, and created a pull request, as listed here
IRC: AnirbanBanik1998 || Anirban
School: ST. Xavier's Institution, Panihati, Kolkata
Primary language: English,Hindi,Bengali
Typical work hours: 9:30 - 23:00 IST GMT/EST/PST to Ist Adjusted Time
Previous GSoC participation: This is my first GSoC participation. Got interested in open-source the day I started with it, and naturally learnt about GSoC soon. Really excited to work with an open-source community, and generate useful open-source code.
Tools(proficient) : Git,Linux
Hardware Skills : Arduino, Raspberry Pi(basic)
Modern “Speak & Spell” using PocketBeagle
Project name: Modern “Speak & Spell” using PocketBeagle
To celebrate the 40th anniversary of the “Speak & Spell” from Texas instruments, create an updated “Speak & Spell” using a PocketBeagle. This should be a general Linux application that can be reproduced and should not be a one-off build.
My approach to this problem will be more in the software domain, and some basic wirings in the hardware domain.
Modern “Speak & Spell” project overview
The Speak & Spell was an electronic hand-held computer first introduced in 1978, and instantly becoming a favourite of children and adults alike. that consisted of a TMC0280 linear predictive coding speech synthesizer, a keyboard, and a receptor slot to receive one of a collection of ROM game. The improvements that I am trying to bring forth in this project are, Speech to Text along with Text to Speech functionality. For the former, I am thinking of using CMU Sphinx, and for the latter I will use CMU Flite which is specifically designed for small-scale embedded systems. I am aiming at implementing speech-recognition for not only recognizing the spelling of the user, but also as a voice launcher for launching the games at the user's command. This can be specifically helpful for those who don't have a keyboard or don't want to use it. This is, of course, an added functionality, as the typing functionality will always be there.
Basically, there are four games to be implemented on this device, "Spell It", "Hangman", "Encrypter"and "Crossword". On Starting the Application:
1. The user will be prompted to select a game among the given list of games. He can either launch the game using his voice or keyboard. Thus the voice-launcher scheme works here.
2. If the "Spell It" game launches, it will prompt the user to speak out the spelling of a word spoken out by the application. The word is from an already created dictionary. Here Text to Speech works.
3. If the "Hangman" game launches, it will ask the user to fill in the blanks to create a word. This game is so made that if any letter is entered or spoken by the user, which is repeated in the answer, then the letter is placed at every blank it is supposed to be in one go.
4. If the "Encrypter" game starts, then it is a multi-user game. One encrypts a word in a certain way, and the other has to decrypt it to get back the original word.
5. In the "Crossword" game, the user has only to specify where he is going to work, i.e. which row or column, then proceed as in the "Hangman" game.
A rough diagram of the "Crossword" game.
Progress till now
I have started working on this project in order to brush up my understanding about how this application is going to work. Firstly, I started with recreating the original games, so that I might be able to add more functionality to them later. Finished building two games, the "Spell It" and "Hangman" and have started work on "Encrypter". Regarding the usage of CMU Sphinx, till now I have restricted myself to only the English Speak and Spell which I aim to extend to the other languages as well. The usage of CMU Sphinx requires a dictionary of words to be recognized be made. I have done it manually, and made a phonetic dictionary of it from the CMU Sphinx website. This dictionary contains words to start and stop games, and alphabets A to Z for aid in Spelling.
A rough functioning of the Hangman Game.
Started with implementation of Speech to text. Gradual progress is being made in it, adding to my experience as well. We have to work in two terminals simultaneously, one for launching the PocketSphinx, and another for the actual games.
I don't have any commitments during the period of GSoC, therefore I can devote 50+ hours per week for this project.
I have already started work on this project and will try to increase the reach of this project to other languages as well as develop the project on beaglebone once selected, reading as much documentation i can get on my hands about beaglebone in the process.
Community Bonding Period
Get acquainted with the documentation of beaglebone, improving on my previous code, and discussing about any further improvements in depth with the mentors.
Week 1 - 2
Working on a web-crawler to crawl the Wikitionary to get the vocabulary of words of various languages. Preparing the introductory presentation slides and video.
Working on building the "Spell It" game, for multiple languages.
Week 4- 5
Rebuilding the "Hangman" game, deployment on pocketbeagle, in multiple languages.
Week 6 - 8
Building the "Encrypter" game, adding multi-user functionality. Adding the functionality to encrypt words based on the user's discretion, including new types of encryptions, and creating a database to use them later.
Week 9 - 10
Building the "Crossword" game, as an extension of the "Hangman" game, and generating scores based on how much of the puzzle is filled up.
Testing, fixing bugs, merging, improving documentation.
Prepare the final presentation slides and video.
A Modern Speak and Spell using PocketBeagle, all of whose parts are reproducible, and generating open-source code for it, for utilizing it in a more widespread way.
Though I had Computer Science as a subject in my school, yet was never really aware of all the things going on in this domain, till I reached college. Was fascinated by the progress occurring in this field. I was particularly impressed by the freedom one gets in Open-Source. I would like to contribute to this community, not only because of my interest in hardware in general, but because this project as a whole appealed to me.
Previous Contributions to Open-Source
- Maintained a repository on Socket-Programming in Python for HacktoberFest. https://github.com/lugnitdgp/Socket-Programming
- Made a multi-client Chat-Server on a single host computer in Node.js as a learning process. https://github.com/AnirbanBanik1998/Chat-Server
- Made a Cleanup application to cleanup memory space filled up by unused apps. https://github.com/AnirbanBanik1998/Cleanup
- As a part of GSoC-Heat, contributed to MemeFinder, a search engine to store memes scraped from Reddit.com https://github.com/NITDgpOS/MemeFinder/commit/bed08441e8a75b0e6aa6afa12ce42ef10dcbe891 https://github.com/NITDgpOS/MemeFinder/commit/04c4ff1c3a407327be7f1918052c8626cf0eea2e
Contributions to this project
As there was no upstream repository to contribute to, I started recreating the games myself, and worked on adding Speech to Text and vice-versa functionalities. https://github.com/AnirbanBanik1998/Speak_and_Spell
- Participated in GSoC Heat contest in our college, and got selected for a mini experience of the actual GSoC. Completed the tasks given in my proposal successfully.
During entire period of GSoC I won't be having any academic duties to fulfill since the summer break would be pretty long. Thus i can devote my whole time to the project. This will in-turn help me in increasing my experience building projects on open-source hardware, and open-source in general.
I believe that I should be chosen for this project because of my love for tinkering around with hardware and interest in open-source contributions as a whole. I would enjoy working on this project and pour in all of the hardwork and passion required.
This device has always been aimed at pre-schoolers, and they will surely find it intriguing as well as educative. The new added functionalities will provide for more robust usage of this device.
What community members speak -
A voice recognition SnS would be kinda cool. Jason Kridner(jkridner[m])
This could help with learning english, but could also be extended to other languages. ie have a chinese speak-and-spell, french speak-and-spell, etc. Erik Welsh(erik.welsh)
I will be active in the community, contributing to more open-source projects, gaining more valuable experience, and help newcomers get acquainted with beaglebone in general, the way I was helped when I first asked out in the group.