Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot
In human speech, the timing function is important for determining its duration, stress and rhythm; however, little attention has been paid to these issues when building a speech synthesis system. In the human brain, the cerebellum plays a key role in the coordination, precision and timing of motor r...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Taylor & Francis Group
2018-10-01
|
| Series: | Connection Science |
| Subjects: | |
| Online Access: | http://dx.doi.org/10.1080/09540091.2018.1510901 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849700402668240896 |
|---|---|
| author | Vo Nhu Thanh Hideyuki Sawada |
| author_facet | Vo Nhu Thanh Hideyuki Sawada |
| author_sort | Vo Nhu Thanh |
| collection | DOAJ |
| description | In human speech, the timing function is important for determining its duration, stress and rhythm; however, little attention has been paid to these issues when building a speech synthesis system. In the human brain, the cerebellum plays a key role in the coordination, precision and timing of motor responses. We have developed a talking robot, which generates human-like vocal sounds using a simplified cerebellum-like neural network model as the timing function. The model was designed using the System Generator software in Matlab environment and the timing duration of trained speech was estimated using hardware co-simulated with a field programmable gate array board (FPGA). The timing information obtained from the co-simulation, together with the output motor vector, is sent to the talking robot controller in order to generate vowels of short, medium and long duration. Using this model for short-range timing of less than 1200 milliseconds, we verify that the short-range learning capability of the cerebellar-like neural network is applicable to the speaking robot for generating a human-like speech with prosodic features. |
| format | Article |
| id | doaj-art-6c6caebca16d4646a5b43f3c1dd798c2 |
| institution | DOAJ |
| issn | 0954-0091 1360-0494 |
| language | English |
| publishDate | 2018-10-01 |
| publisher | Taylor & Francis Group |
| record_format | Article |
| series | Connection Science |
| spelling | doaj-art-6c6caebca16d4646a5b43f3c1dd798c22025-08-20T03:18:17ZengTaylor & Francis GroupConnection Science0954-00911360-04942018-10-0130438840810.1080/09540091.2018.15109011510901Simplified cerebellum-like spiking neural network as short-range timing function for the talking robotVo Nhu Thanh0Hideyuki Sawada1The University of Danang, University of Science and TechnologyWaseda UniversityIn human speech, the timing function is important for determining its duration, stress and rhythm; however, little attention has been paid to these issues when building a speech synthesis system. In the human brain, the cerebellum plays a key role in the coordination, precision and timing of motor responses. We have developed a talking robot, which generates human-like vocal sounds using a simplified cerebellum-like neural network model as the timing function. The model was designed using the System Generator software in Matlab environment and the timing duration of trained speech was estimated using hardware co-simulated with a field programmable gate array board (FPGA). The timing information obtained from the co-simulation, together with the output motor vector, is sent to the talking robot controller in order to generate vowels of short, medium and long duration. Using this model for short-range timing of less than 1200 milliseconds, we verify that the short-range learning capability of the cerebellar-like neural network is applicable to the speaking robot for generating a human-like speech with prosodic features.http://dx.doi.org/10.1080/09540091.2018.1510901cerebellumneural networktiming functiontalking robotfpga |
| spellingShingle | Vo Nhu Thanh Hideyuki Sawada Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot Connection Science cerebellum neural network timing function talking robot fpga |
| title | Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot |
| title_full | Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot |
| title_fullStr | Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot |
| title_full_unstemmed | Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot |
| title_short | Simplified cerebellum-like spiking neural network as short-range timing function for the talking robot |
| title_sort | simplified cerebellum like spiking neural network as short range timing function for the talking robot |
| topic | cerebellum neural network timing function talking robot fpga |
| url | http://dx.doi.org/10.1080/09540091.2018.1510901 |
| work_keys_str_mv | AT vonhuthanh simplifiedcerebellumlikespikingneuralnetworkasshortrangetimingfunctionforthetalkingrobot AT hideyukisawada simplifiedcerebellumlikespikingneuralnetworkasshortrangetimingfunctionforthetalkingrobot |