Saturday, March 30, 2019

Advanced technology in speech-based interfaces

Advanced applied science in lecturing-establish interfacesAbstractSpeech- natesd interfaces be non modern to computing, they surrender been tellingly under apply as an efficient and effective method of merciful beings and computer interaction. The applied science has been of cracking interest over the past hardly a(prenominal) years, although thither argon still signifi basist improvements and possibilities for the prox. This paper investigates current usages and modulars of the technology and what contri unlessions are being made. The paper to a fault identifies close to accomplishable hereafter wonts of Speech- ground interfaces, and possible future benefits of this technology, when compared to current methods and certain types of wontrs.Speech-based interfaces are non immature to computing, they hand over been relatively underused as an efficient and effective method of man and computer interaction. A background to the technology is included and it is desc ribed how the subscribe to for natural expression and pitch interfaces increased, and there became a need for regularisation, and the standard VoiceXML was released. From this standard other technologies were born, including a combination of XHTML and VoiceXML to develop cyberspace activitys with a speech-based interface. These technologies combine with web and car technologies fill provided an opportunity for illustration restrainer motor vehicle control in the near future. While this technology has been designed to help the average person be much efficient, with any(prenominal) dainty turns there outhouse be benefits to be gained from gray drug users and incapacitate users as well. With e actually new technology there exist problems which exit be discussed as well, and this will lead to a conclusion summarising points and justifying the benefits. instinctive language interfaces are an essential part of Human Computer Interaction, as the number of thinks in the world still outnumbers of computers and therefore natural language is more widely used than a mouse or keyboard. To quiet the progress of exchanges between humans and machines the World large-minded entanglement crime syndicate (W3C) has published a recommendation for vocal interaction language based on XML, which allows interactions on m whatever interfaces including Internet applications by using XHTML combined with VoiceXML. Because VoiceXML uses the HTTP protocol to transport it is possible for a VoiceXML teleph single gateway to communicate with a web horde, in this type of environment the web server is providing a repartee to a user on a head scream and bridging the gap between phone and Internet. This is supported by the World unsubtle Web Consortium (2010)The telephone was invented more than 150 years ago, and continues to be a very important office for us to communicate with to each one other. The Web by comparison is very recent, but has rapidly pay off a comp eting communications channel. The convergence of telecommunications and the Web is now bringing the benefits of Web technology to the telephone, enabling Web developers to create applications that ignore be get toed via whatever telephone, and allowing slew to interact with these applications via speech and telephone keypads (p. 1).VoiceXML is becoming a standard for Human-Computer speech sound, with speech synthesis and recognition of verbalize input. This technology brings the ability to have a natural conversation as an Internet and content role interface. An automated phone system with VoiceXML also has the ability to register or translate multiple languages. The popularity is increasing as major companies such as IBM, HP and Motorola are now supporting and using VoiceXML. A major goal is to bring the advantages of web-based using and content delivery to interactive component response applications (Rouillard, 2007, p. 27).XHTML + Voice (X+V) are a technology for desc ribing visual and audio web pages, visual interaction is described by XHTML and auditory interaction is described by VoiceXML. Enabling users to have a HTML give away of a website, with the ability to navigate and use the site by persona or by traditional methods of input. Until recently XHTML and VoiceXML (X+V) occasionality had not been implemented by major Internet browser companies, instead it had been used by small companies with government grants and been talked about as a possible future technology. presently the opera house web browser offers native support for XHTML and VoiceXML, it will also attempt voice interaction with standard XHTML pages. While Internet adventurer and Firefox still do not have native support for XHTML and VoiceXML, although tierce party extensions and add-ons have been created. Opera Software ASA say, any ordinary browser command basis be done by voice, such as navigating to, and following the close link in a document, going to the next slide in an Opera Show presentation, or logging on to a password protected Website (p. 1). XHTML and VoiceXML offer an increased opportunity with Opera web browser now being installed in Ford vehicles, for a speech-based interface to en fitted eye-free and hands-free computer interaction while driving. This technology could potentially control dash-panel and computer systems via speech-based interfaces, alter users melt downality from changing the temperature of the heater to sending emails by voice while driving a car. Opera Software ASA say, This event will allow Ford truck and van owners to maintain a virtual work environment with vex to all of the important files, tuition and applications they need on a daily basis (p. 1).Because XML is a energizing and universal language overseen by the W3C, it means that XML based technologies such as VoiceXML are not limited to Internet applications. The same piece of XML can be used for various applications and imported into other applicati ons if they support it, and there is no reason why VoiceXML cannot be the same in the future as well. Mobile phones for some time have had the ability to enjoin text nitty-grittys and email messages aloud to the user, which could be beneficial for visually stricken persons and persons operating a vehicle. Text-to-speech software reads the text on the screen aloud in a natural sounding voice, giving you convenient access to phone menus and functions, short messages, e-mail messages (Nokia, n.d., p. 1). Using VoiceXML based technology it is completely possible for a user to read a text message aloud to the mobile phone, the phone translate this to textual content and sends it via the SMS service. This may sound silly at first, due to the technology to be able to call someone and say it verbally without a computer translating the oral communication into text for you. Although this would give businesses a greater ability to stay in contact while on the move, as text messaging is used extensively in business and pick outred in some cases depending on the message being sent. This could also provide a solution to a major problem with cellular phones, which is texting while driving. In principle a technology that allows a user to drive and sent text messages safely while talking to their cell phone will save lives and set up lives easier. Talking to a passenger or singing to the radio has not been noted as a significant cause of crashes, which are very similar functions to verbalising a text message. Government officials arent the only ones getting on the texting ban-wagon. TV talk show host Oprah Winfrey has launched a national telecasting and Internet campaign to encourage people to commit to putting their cell phones away while driving (Hattiesburg American, 2010, p. 1). As technology has progressed, people have continuously sought after smaller and smaller bends with greater spot and speed. Technology has reached the point where the input devices them selves are holding back the device from becoming any smaller. Voice interaction can escape the personal limitations on keypads and displays as mobile devices snuff it ever smaller (World Wide Web Consortium, 2010, p. 5).With a global aging population it is important that we enable and help elderly people to function and live as independently as technology will allow. Elderly people may be able to benefit by the advancement of speech-based technologies, but to first substantiate how they could benefit, it is important to understand their characteristics. The human interfaces to most computer systems for general use have been designed, either deliberately or by default, for a typical, younger user (Gregor, P., Newell, A. F., 2001, p. 1). Elderly people can be crudely generalise into three groups fit sr. people, frail older people and older people with long term disabilities. Fit older people can be described as those who appear or do not consider themselves modify. Frail olde r people who would be considered as disenable and have one or more difficulties, including at least one that impairs their functionality in some way. The elderly who have had a long-term constipation throughout their life that has affected the aging process and their ability to function is dependent on declining functions. Other aspects to keep into consideration are the discrepancy in physical, sensory and cognitive abilities with the elderly, as one size does not fit all in this situation. Another aspect is the variations in ability to operate a computer system due to disabilities, impairments and learning capabilities. Gregor and Newell (2001) endIn general, as people grow older their abilities change. This process of change includes a decline over time in the cognitive, physical and sensory functions, and each of these will decline at diametric rates relative to one another for each individual. This pattern of capabilities varies widely between individuals, and as people gr ow older, this variability increases. In addition, any given individuals capabilities set off in the short term due, for example, to temporary decrease in, or overtaking of, function due to a variety of causes including illness, blood sugar levels and land of arousal (p. 2). Interfaces for older people need to have a greater diversity of functionality when compared to a younger group, to meet the greater needs. By providing a speech based interface as an option for operating a computer, it is dependent on a function that most people have used their entire lives and is reliant on a function that is not considered to dramatically decrease with age. This can also enable them to use a computer system with a telephone as described antecedently with VoiceXML capabilities, for those who are intimidated by technology and the thought of using a computer. Finally the interface designed needs to use general harm over proficient terms, for example moving to the main section instead than c licking on the home link.Most systems and interfaces are designed for typical good for you(p) or high functioning users, when compared with users with disabilities that can have difficulties using a standard keyboard or mouse. It is important with the growth of the Internet and technology that disabled users are not left out, and that they are able to access these resources if they choose, or if it could benefit their lives. There may be situations where a computer application could benefit the life of somebody with a handicap, but they cannot use a computer due to motor-function restrictions. This demonstrates the need for hands-free or eye-free computer access and includes 2 main groups, visually impaired users and motor-handicapped. The Web Accessibility Initiative (WAI) whole kit and caboodle with organizations around the world to develop strategies, guidelines, and resources to help make the Web convenient to people with disabilities (Web Accessibility Initiative, 2009, p. 1). Many applications and web browsers are developed to take to heart people with disabilities, although many of them have been quietly withdrawn leaving depressed links or on the occasion that the system is still addressable for download it may have been abandoned and not maintained anymore. An important aspect of developing voice applications for handicapped users is that they may want to use voice control in combination with other interfaces such as a joystick or other aid devices. The aim of speech systems is generally naturalness and to copy conversations that we have had our entire lives, but in the case of users with disabilities it may be more beneficial to aim for learn-ability over naturalness. For example instead of saying activate microphone or something technical to activate the microphone, saying Wake Up un-mutes the microphone and turns on the neat in left side (Brondsted Aaskoven, 2005, p.4). Technology is currently heading toward eye-free and hands-free access of systems, for purposes such as accessing a computer while driving a car or making us more productive. The same base technology is required to support speech based services for disabled users, but the difference of needs when interacting are very different. We generally would prefer to speak to a computer in a turn based communication alike(p) we have when we are talking to other human beings, although as an aid for using systems or interface for disabled users it would be more beneficial to use command driven voice systems using non-technical terms. While still using human to human terms, such as wake-up and sleep which even severely mentally disabled users would understand. There are people with mental disabilities so severe that they are unable to understand wake-up or sleep, but they are highly unlikely to have any need for a computer, as they are more concerned with living day to day.The VoiceXML standard has ensured a guideline for developing voice applications, but there ar e currently no standards for the development environments or interfaces. This means that the layout and functionality from development environments will be completely different, and the code generated by the development environments will not necessarily be compatible, as the two different development environments will generate completely different tags and formats. Building spoken applications from scratch can take a long period of time, and several(prenominal) different exemplars and technologies. As VoiceXML works with predetermined grammar, which can be troublesome in the development of some applications. But by combining the VoiceXML course of study with independent systems for voice recognition, it is possible to increase its capacities of understanding. VoiceXML is great step toward speech and voice based interfaces, but it has a lot of work to become a complete framework for developing speech applications. Accordingly, a great deal of emphasis has been placed on the develop ment of toolkits and environments that hide some of this complexity and allow developers to rapidly prototype and deploy speech-based applications. (Bennett Llitjod Shriver Rudnicky Black, 2002, p. 1). Natural speech-based interfaces can provide a known and familiar interface for interacting with computer systems, because we drop down our lives conversing with other people and communicating over the telephone. Current technology makes it possible to interact with a website or computer application via a telephone and it is possible to translate the language spoken for the system, and translate a response back to the user. The ability to use a generic markup language like VoiceXML with applications such as XHTML is a leap forward in creating an Internet that can be accessible via speech-based interfaces. This enables future technology such as voice controlled functions of a motor vehicle and improved cell phone speech interface. One of the most significant impacts of this technol ogy is the ability for elderly people to use a function is not known for regress as a computing interface. This will also enable users who are new to computers but familiar with telephones to use a computer more easily. Many disabled people struggle to maintain their independence, with motor function restrictions that prevents them from using a computer effectively. With the ability for disabled people to misrepresent programs and browse the Internet with a speech interface, it could help them maintain their freedom and independence. As with all new technologies, there are severe problems that a solution must be found for before this technology can take off this includes a standard for a complete framework rather than just a markup language providing grammar and large vocabulary support. It is conclude that speech-based interfaces currently, and will continue to, provide benefits in the advancement of the technology, providing that the right people get access to this technology a nd not just the average user who is happy to type.ReferencesBennett, C., Llitjod, A. F., Shriver, S., Rudnicky, A., Black, A.W. (2002). Building voicexml-based applications. Paper presented at the7th International Conference on Spoken Language Processing September 2002, Denver, Colorado, United States of America. Retrieved February 19, 2010, from http//www.cs.cmu.edu/awb/papers/ICSLP2002/voicexml.pdfBrondsted, T., Aaskoven, E. (2005). Voice-controlled internet browse for motor-handicapped users. Design and Implementation Issues, Interspeech 2005. doi10.1.1.65.3974Gregor, P., Newell, A. F. (2001). Designing for Dynamic variety Making accessible interfaces for older people. In J. Jorge., R. Heller., R. Guedj (Eds.). Proceedings of 2001 EC/NSF shop on Universal Accessibility of Ubiquitous Computing Providing for the Elderly 22-25 whitethorn 2001, Alcacer do Sal, Portugal. Dunhee University of Dunhee.Hattiesburg American. (2010). Texting while driving deadly at any age. Retrieve d borderland 1, from 2010 from http//www.hattiesburgamerican.com/article/20100221/OPINION01/2210304/Texting-while-driving-deadly-at-any-ageOpera Software ASA. (2010). Opera Tutorials. Retrieved expose 1, 2010 from http//www.opera.com/browser/tutorials/voice/using/Opera Software ASA. (2009). Opera brings full web browsing to new ford trucks and vans. Retrieved March 3, 2010 from http//www.opera.com/press/releases/2009/04/02_2/Nokia. (n.d.). Nokia accessibility Text to speech. Retrieved March 1, 2010 from http//www.nokiaaccessibility.com/tts.htmlRouillard, J. (2007) Web services and speech-based applications around voicexml. Journal of Networks, 2(1), 27-35.Web Accessibility Initiative. (2009). near WAI. Retrieved March 1, 2010 from http//www.w3.org/WAI/about-links.htmlWorld Wide Web Consortium. (2010). W3C voice browser working(a) group. Retrieved March 1, 2010 from http//www.w3.org/Voice/

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.