contact us | support Technology to Bridge the Language Gap
Products
| MediaSphere |
|
|
|
|
|
Automated Media Monitoring AppTek’s MediaSphere provides media monitoring of broadcast and telephony speech for commercial and government customers. The system monitors a variety of media sources and provides a turnkey solution that uses AppTek’s speaker-adaptive automatic speech recognition engine and hybrid machine translation system. MediaSphere offer industry-leading speed and accuracy in media monitoring.Additional engines for non-broadcast sources, and increased coverage of dialects for automated speech recognition provide a unified and scalable solution that offers text processing with speech-to-text transcription, machine translation of transcribed text, information retrieval with query translation, and automated name-entity detection. MediaSphere identifies keywords and phrases in TV, radio, video, audio, and telephony providing multilingual transcripts from broadcast and conversation telephony sources. In addition, the system provides translation of the transcribed text and delivery of rich media content online.This advanced linguistic tool also provides video and audio logging technologies and Audio Mining.Components1. Automatic Speech Recognition (ASR) and Audio Mining AppTek provides ASR for Modern Standard Arabic (MSA) that is speaker independent. The system supports MSA but is designed to collect, transcribe, and train using selected dialects. The system can support multiple local dialects and improves the accuracy of performance.Spoken Language: the Natural Dialect
2. Knowledge Management System In the area of Knowledge Management, AppTek offers several software modules under the collective name PlainKnowledge, which have already been integrated into third party products or have also been implemented as SDK for end customers. The individual modules are PlainCluster (grouping), PlainClassify (classification), PlainSummarize (summarization), PlainExtract (content-specific extraction), PlainLingua (language recognition of a text), and PlainRetrieve (associative search). 3. Machine Translation (MT) Technology AppTek provides the Machine Translation (MT) application for more than twenty (20) different language pairs. The system can operate on its own or can be used as a dynamic, Web-based translation tool for online content. It is fully integrated with Automatic Speech Recognition and Text-to-Speech software, which enables future development of speech-to-speech Machine Translation. The suite of linguistic tools includes proper-noun recognition, morphological analysis, and large, annotated lexicons. 4. Digital Assets Management (DAM) The Digital Assets Management system is comprised of multiple agents that provide access to various resources. These resources are collected, stored, and retrieved via the Digital Asset Management Network. The network refers to the interconnection and collaboration of the various agents; it is not a computer-based network. The entire network can be contained within a single computer, but this may not be practical given the resources required to perform real-time analysis. In some cases, agents may exist on the same computer to provide less-latent access to published data. AppTek’s human language technology and DAM have been integrated in the UIMA environment, an open architecture for the management of unstructured information such as natural language text (e.g., news articles), audio, and video. This management includes analysis, search, storage, information retrieval and extraction, data mining, etc. UIMA analyzes unstructured information in order to produce structured databases or indexes. 5. NameFinder NameFinder™ is an advanced technology engine that scans text for proper nouns (such as human names, organizations, geographic locations, currencies, parts, etc.) in various languages, and even recognizes proper nouns in writing systems that do not use capitalization. NameFinder™ can also accurately identify the ethno-linguistic origins of a person’s name. The system recognizes and transliterates between the Roman (Latin) alphabet and other writing systems, and can identify transliterated names despite discrepancies, ambiguities, or simple misspellings. FunctionalityTelevision signal reception and distribution and Telephony Audio Monitoring capability This solution provides 24x7 monitoring of a foreign language channel, English-speaking channel, or telephony platform. The monitoring capabilities include:
Review streaming video with synchronized transcriptions and translations Review Streaming Video: The system automatically builds a rich, structured index in real time that can be used to search video content and immediately locate a specific video segment for playback. The video index is time synchronized to every encoded copy providing immediate and exact retrieval of the content. Words and phrases are automatically highlighted within a transcript, allowing users to monitor the Arabic (source) speech-to-text recognition and English translation synchronized with the video. Review Video Segments: The Video Digitizing process transforms standard analog or digital video into Web-friendly and broadcast-quality content. Video Digitizing manages multiple digitizing processes while simultaneously indexing video. VideoCapture automatically detects visual scene changes, spoken words, and recognized voices. The resulting rich video index provides both editors and viewers with fine-grained control over content. The video index allows users to select video segments based on channel, start time, video frame, or between live (with 10-minute latency) or archived videos. Review Still Frames: VideoCapture uses signal analysis algorithms to generate key frames, which provide a visual overview of the content. Search for specific spoken content in the video cache by Arabic or English-language text query Full-text search in Arabic or English: Using Advanced Search, users can narrow searches to include a specific broadcaster, program, language, asset type, date or range of dates, Boolean operators, or keywords. Search queries can be saved and used again and again. The search capability supports all languages managed by the system. The linguistic-based search integrated with the system (TextFinder™) provides the following advanced capabilities:
Reference lists and real-time alerts Alerts from keyword-based filters: MediaSphere monitors news broadcasts and alerts users of new content that is of interest to them. All incoming information is matched against users’ saved queries, and alerts are triggered when the specified criteria in the saved queries matches the incoming information above a specified threshold. The use of contextual profiling and personalization ensures that subtle differences in context are taken into account before alerting a user of new information. It is crucial that any system that alerts a user of new content does so only when the content is relevant to the user and the system allows for an arbitrary level of customization to take place. Embedded proper noun identification is open to user-specific keyword list creation, modification, or addition. Provide graphical temporal display of the occurrence of topics or keywords within one or all channels The Video Digitizing technology includes the following features:
Other Functionality
|



