89 resultados
¿Por qué es gratis Capterra?
El software de reconocimiento de voz líder del sector, utilizado por médicos, abogados y otros profesionales para convertir un discurso en texto. A partir de los 119.99 USD para la edición Premium, Dragon ha sido utilizado por miles de profesionales para el dictado y la transcripción durante más de 30 años. Se ejecuta en plataformas de Windows y Mac. Convierte el discurso en texto al realizar un dictado en aplicaciones basadas en Windows a velocidades de hasta 160 palabras por minuto.
Sistema de cómputo técnico que proporciona herramientas para el procesamiento de imágenes, geometría, visualización, aprendizaje de máquinas, minería de datos y mucho más. Sistema de cómputo técnico que proporciona herramientas para el procesamiento de imágenes, geometría, visualización, aprendizaje de máquinas, minería de datos y mucho más.
No es un servicio de transcripción típico. Sonix es una plataforma en línea. Sube un archivo a Sonix y en menos tiempo que la duración de la grabación, recibirás un correo electrónico notificándote que tu transcripción ha finalizado. El correo electrónico incluirá un enlace a la transcripción. La transcripción incluye marcas de tiempo, resaltado y funcionalidad de edición integrada en la transcripción. Se puede exportar a muchos formatos para usar en producciones o redes sociales. Transcribir y editar audio y video es difícil. Sonix lo hace rápido, simple y asequible.
Solución de centro de contactos multicanal basada en la nube que presta servicios a más de 1000 clientes en 20 mercados verticales. Solución de centro de contactos multicanal basada en la nube que presta servicios a más de 1000 clientes en 20 mercados verticales.
Multi-language speech recognition software with the ability to dictate in any third party software or to fill forms on websites. Apart from dictation, Braina also provides voice command features that allows you to search the web, open file, programs & websites, find information, set reminders, take notes and much more. You can use your voice to dictate text to your Windows computer, automate processes and improve your personal and business productivity. Multi-language speech recognition software with the ability to dictate in any third party software or to fill forms on websites.
Talkatoo is a speech-to-text software. Talkatoo has been built specifically for veterinarians and has a built-in vet vocabulary. Talkatoo is a subscription-based software and starts at $55.96/month. There is no commitment and no additional fees or hardware. Talkatoo understands accents and does not require a training period. Complete your medical records in half the time. Talkatoo works in any field, dictate in all practice management software, MS Word, Google Docs, email, etc. The speech-to-text software for veterinary professionals. Processes up to five times the average typing speed. Works everywhere.
Una solución de reconocimiento y conversión de voz, en varios idiomas, documentos y transcriptor de correos electrónicos y más. Una solución de reconocimiento y conversión de voz, en varios idiomas, documentos y transcriptor de correos electrónicos y más.
CallFinder is a leading provider of SaaS speech analytics software, automated call scoring, and speech-to-text transcription technology with conversational insights, such as sentiment analysis. Our easy-to-use solution is designed to help small and medium-sized businesses and contact centers automate quality monitoring to improve agent performance and provide a superior customer experience. All CallFinder clients are supported by our unparalleled MyAnalyst managed client support service. CallFinder® is a leading provider of SaaS speech analytics, automated call scoring, and speech-to-text transcription technology.
Through technology, insight and experience, BigHand delivers success for the future by helping its clients achieve professional productivity and operational excellence. The leading software technology company has developed a range of solutions from task delegation, document creation, matter pricing, digital dictation workflow, intuitive reporting and analytics, that help busy people achieve more in less time and organizations become more efficient and effective. BigHand offers speech, workflow, document creation, process improvement, matter pricing and BI solutions for law firms of all sizes.
Allows physicians to produce more accurate reports using dictation and speech recognition technology. Allows physicians to produce more accurate reports using dictation and speech recognition technology.
Go Transcribe provides the latest software invention to convert speech in to text which will save you time, money and effort. Simply upload your files onto our platform using any device and your file will be converted in a matter of minutes. The transcription can be viewed on our unique online editor. You can playback the original file and jump to specific parts of the audio and make amendments to the transcription where required. Your transcription can be downloaded to several popular formats. Cloud based transcription service powered by artificial intelligence. Automatically converts audio/video files into text
Mobile app that recognizes speech by sound or text and can translate from web pages, communications, and more. Mobile app that recognizes speech by sound or text and can translate from web pages, communications, and more.
NexGen Mobile Solutions (formerly Entrada) cloud-based engagement platform for healthcare providers streamlines workflows & reduces physician burnout. Providers can view their clinical schedule and EHR patient data from their mobile device and dictate patient encounters anytime, anywhere that populate inside the EHR. They can also communicate with their care team through secure text messaging. Available on Android and iOS platforms for physician groups of all specialties and sizes. NexGen Mobile Solutions (formerly Entrada) solves physician burnout by improving EHR workflows through its speech-driven documentation.
Reason8 is an AI-powered service for automatic note taking and preparation of summaries for in-person business and scrum meetings. We provide the best note taking quality on the market because we use multiple smartphones and AI patent pending approach to boost quality of speaker separation and drafting meeting summaries. We are actively working on advanced summarization, collaboration features for teamwork, and integrations with project management services and communication tools. AI-powered service for automatic note taking and preparation of summaries for in-person business and scrum meetings
Zubtitle is an online video editing tool that leverages A.I. and speech-to-text software to automatically add captions/subtitles to any video. Zubtitle also provides video editing tools tailored to social videos. Quickly resize videos for any social platform, add video headlines, custom styling, and more. Zubtitle gets videos ready for social media in minutes. Automatically add captions & headlines effortlessly, plus resize your video.
Speech recognition software for hospitals and medical practices. Allows to dictate notes straight into a Windows-based EMR. Speech recognition software for hospitals and medical practices. Allows to dictate notes straight into a Windows-based EMR.
SmartAction provides cloud-based AI-powered Virtual Agent solutions for contact centers. SmartAction's solutions make it easy for enterprises to automate the repetitive conversations handled by live agents, with seamless integrations to existing contact center technology and data sources. SmartAction delivers its conversational AI solution as a service through a team of CX experts who guides brands through the transformation to automation. SmartAction provides omnichannel AI-powered Virtual Agent solutions for contact centers.
Trint utiliza inteligencia artificial para impulsar tu plataforma de transcripción automatizada basada en la web. Los archivos de audio y video se cargan en el software Trint en línea y luego se transcriben utilizando el reconocimiento de voz automatizado. Trint Editor es la combinación de un editor de texto y un reproductor de audio/video: el texto transcrito se une al archivo de audio o video, lo que facilita la búsqueda, verificación y edición de las transcripciones generadas por la máquina. Trint va más allá de la transcripción para proporcionar la plataforma más innovadora para buscar, editar y aprovechar al máximo tu contenido.
Speech to text dictation application for Windows. Experience the freedom of typing with your voice. Speech to text dictation application for Windows. Experience the freedom of typing with your voice.
Gran reconocimiento de voz y aplicación web de traducción de voz instantánea que hace hincapié en la simplicidad y el habla natural mediante la puntuación automática. Características: PUNTUACIÓN AUTOMÁTICA, marca y guarda MARCAS DE TIEMPO, editable, GUARDA AUTOMÁTICAMENTE, transcribe archivos de audio, conversaciones telefónicas y exportaciones a subtítulos. No es necesario registrarse como usuario. Úsalo para dictados, transcripciones, entrevistas, problemas de audición, intérpretes en tiempo real, entre otros. Speechlogger está impulsado por las API de ASR de Google para lograr los mejores resultados. Gran aplicación web de reconocimiento de voz y traducción de voz instantánea gratuita que hace hincapié en la simplicidad y el habla natural mediante la puntuación automática.
Online service and android app for recording and transcribing speech. It edits your audio as you edit the text. Online service and android app for recording and transcribing speech. It edits your audio as you edit the text.
Advanced medical dictation software is built for physicians and practitioners. Works on all EHR platforms and mobile. Build better documentation through speech to text recognition engine designed for medical notes and charts.
Mobile and Cloud-based solution for businesses that helps upload audio files through web, mobile, or cloud & document them to text. Mobile and Cloud-based solution for businesses that helps upload audio files through web, mobile, or cloud & document them to text.
Transcribe converts interviews, podcasts and other audio recordings into text automatically. Transcribe converts interviews, podcasts and other audio recordings into text automatically.
CONVIERTA LAS LLAMADAS EN INGRESOS Empresa de tecnología AI que proporciona soluciones de análisis de voz para centros de llamadas​. Optimice la comunicación con el cliente escuchando las llamadas de los clientes automáticamente. Las herramientas de NeoSound convierten las emociones humanas en datos procesables y significativos que permiten a las empresas escuchar la voz real del cliente. Empresa de tecnología AI que proporciona soluciones de análisis de voz para centros de llamadas​.
Castel Detect LIVE is the LIVE alternative for contact center speech analytics. It provides LIVE compliance and post-call analysis, supporting your quality assurance initiatives. This centers focus on agent behaviors positively and negatively impacting customer experience outcomes. Our analytics process occurs during a LIVE call, so you can take real-time action to ensure compliance and best practice adherence. We provide voice-based analytics, event targeting, agent alert, and workflow tools. Castel Detect LIVE analyzes LIVE calls with high accuracy, alerts, reminders, scripting, and call scoring. Ensure real-time compliance.
Web-based application that allows providers universal access to their work, as well as e-signature and report management capabilities. Web-based application that allows providers universal access to their work, as well as e-signature and report management capabilities.
WSR is an enterprise speech recognition solution that offers front-end (client-side) and back-end (server-side) voice-to-text recognition. With WSR, speech recognized text can be accessed immediately by the author or automatically sent to support staff for review and editing (if needed) - enabling your key earners to focus their time on more revenue generating activities and less on administrative tasks. WSRs voice-to-text technology is easy to use, accurate and light on IT resources. An enterprise speech recognition solution that offers front-end (client-side) and back-end (server-side) voice-to-text recognition.
Software de reconocimiento de voz basado en la nube con la capacidad de convertir voz a texto. Software de reconocimiento de voz basado en la nube con la capacidad de convertir voz a texto.
Speech surveillance and metrics analysis software. This includes text transcription with alert generation and disposition mechanism, and metrics analytics. Speech surveillance and metrics analysis software. This includes text transcription with alert generation and disposition mechanism, an
Language channel type and accent agnostic speech-to-text solution. Speaker identification and voice activity detection technologies. Language channel type and accent agnostic speech-to-text solution. Speaker identification and voice activity detection technologies.
Express Dictate software is a voice recording program that works like a dictaphone. It lets you use your PC or Mac to send dictation to your typist by email, Internet or over the computer network. Professional dictation voice recorder. Works like a traditional dictaphone. Send dictation instantly via the Internet. HIPAA compliant secure encryption. Record to wav, mp3 or dct formats. Easy-to-use interface so you can be dictating in just minutes. Record and send dictation directly from your computer with Express Dictate Digital Dictation Software.
Transcription and editing tool that helps you transcribe audio online by combining a media-player and a text editor. Transcription and editing tool that helps you transcribe audio online by combining a media-player and a text editor.
Crescendo Speech is the first engine to support speaker independent speech recognition for large vocabularies. Available for both front and back-end use, the engine requires zero training with out-of-the box accuracy rates reaching over 95%. Comprehensive speech recognition solution for professional, dictation-intensive environments.
Ver perfil
A flexible API that performs hardware and speaker independent speech recognition on audio data from any audio source. A flexible API that performs hardware and speaker independent speech recognition on audio data from any audio source.
(0 reseñas)
Ver perfil
Transcription tool that helps in dictation, transcription and speech recognition through document editing, EMR integration & more. Transcription tool that helps in dictation, transcription and speech recognition through document editing, EMR integration & more.
(0 reseñas)
Ver perfil
Employs both word spotting and phrase spotting technologies to avoid the limitations of discrete word command &control. Employs both word spotting and phrase spotting technologies to avoid the limitations of discrete word command &control.
(0 reseñas)
Ver perfil
VoltDelta OnDemand Solutions provides a hosted infrastructure for enabling virtual contact centers and home agent call distribution and management, inbound and outbound voice recognition applications, and voice of the customer call and agent screen recording. VoltDelta supports more than 2.4 billion calls and 2 billion SMS text messages per year. Hosted automation center to handle all IVR/speech applications with intelligent ACD and CTI abilities.
(0 reseñas)
Ver perfil
Cloud-based speech recognition software that enables users to play games, control applications and create custom speech commands. Cloud-based speech recognition software that enables users to play games, control applications and create custom speech commands.
(0 reseñas)
Ver perfil
Speech processing tool which enables automated indexing of audio data through interactive conversational systems. Speech processing tool which enables automated indexing of audio data through interactive conversational systems.
Ver perfil
Speech recognition tool which provides translation of text into audible voice recordings through automation. Speech recognition tool which provides translation of text into audible voice recordings through automation.
(0 reseñas)
Ver perfil
Software utilizing voice biometrics to create solutions for security, either web based or installed, with custom reporting and more. Software utilizing voice biometrics to create solutions for security, either web based or installed, with custom reporting and more.
(0 reseñas)
Ver perfil
Software to transcribe speech and audio from voice mails into text format, deliverable as either as an e-mail or as an SMS. Software to transcribe speech and audio from voice mails into text format, deliverable as either as an e-mail or as an SMS.
(0 reseñas)
Ver perfil
Rubidium cubre todo el alcance de un sistema de diálogo de voz: entrada, salida e interacción. Continuamente se innova en soluciones de procesamiento de voz líderes en el sector para aplicaciones integradas, tales como TTS, ASR, compresión de voz e identificación de altavoz biométrico. Se brinda ayuda a los OEM/ODM para brindar a los clientes una experiencia de usuario más productiva y sin manos. Las soluciones VUI multilingües de bajo costo y tamaño reducido permiten a los desarrolladores de productos de consumo llevar sus productos al mercado lo más rápido posible. Soluciones de procesamiento de voz para aplicaciones integradas, tales como TTS, ASR, compresión de voz e identificación de altavoz biométrico.
(0 reseñas)
Ver perfil
Voice capture, speech recognition, editing, distribution and e-signature application platform for healthcare documentation. Voice capture, speech recognition, editing, distribution and e-signature application platform for healthcare documentation.
(0 reseñas)
Ver perfil
Speech recognition for your audio and video files. Speech to text, speaker diarization, voice activity detection. API for easy integration of SpokenData speech recognition into various applications. Advanced transcription editor, adaptive speech recognizer adaptation on user data. Speech recognition for your audio and video files. Speech to text, speaker diarization, voice activity detection.
(0 reseñas)
Ver perfil
Parlance uses speech recognition to modernize and improve the first 30 seconds of every caller's journey, for call centers that want to deliver a better customer experience. With voice-driven access, callers can speak naturally and connect quickly to the resources they need inside large organizations. No punching numbers on a dial pad No long phone tree options to listen to No frustrating auto attendants that repeatedly misunderstand caller response We guarantee ROI! Parlance uses speech recognition to modernize and improve the first 30 seconds of every caller's journey. We guarantee ROI!
(0 reseñas)
Ver perfil
A web-enabled, application service provider (ASP) technology platform for traditional and speech recognized medical transcription. SpeechRite for radiology is a front end speech recognition program with excellent quality, and comprehensive workflow that supports all dictation preferences. It is offered at NO COST, NO HARDWARE, NO RISK, and PAY-PER-USE. It integrates with all PACS/RIS using xml file exchange. It has modules for CTRM, BIRADS, Addendums, Priors, Templates, and macros. ASP web-based dictation and transcription workflow solution for hospitals, MTSOs, clinics, physicians, of any size.
(0 reseñas)
Ver perfil
Ameyo Engage es un software de centro de atención telefónica basado en la nube que permite a las empresas tomar el control de sus operaciones mediante la implementación de cambios más rápidos en las iniciativas de interacción con el cliente y la participación de los empleados, lo que da como resultado una mejor experiencia para cliente y un aumento de las ventas y las colecciones y, en última instancia, la adquisición de clientes fieles y empleados contentos. Haz crecer tu negocio ganando la fidelización del cliente con un software de centro de contacto para clientes de clase mundial.
(0 reseñas)
Ver perfil
Dictation, transcription and speech recognition software serving over 3,500 clients across many industries. Dictation, transcription and speech recognition software serving over 3,500 clients across many industries.
Ver perfil
Red Shift specializes in speech technologies and has the ability to voice enable smartphones, tablets and websites. Red Shift specializes in speech technologies and has the ability to voice enable smartphones, tablets and websites.
(0 reseñas)
Ver perfil
Speech to text software solution that converts live and recorded contact center calls into searchable text. Speech to text software solution that converts live and recorded contact center calls into searchable text.
A secure, cloud-based speech recognition platform for clinicians to securely document patient encounters of all types. Meet more patients and focus on providing care by significantly reducing the time spent in documentation. iPhone and Android apps. No profile creation or training needed. There are no upfront costs; only pay a monthly fee. Access to eCareNotes Customer Service Team 24x7 included. eCareNotes Cloud-based Speech Recognition for Clinicians: Simple - Affordable - EMR Ready
(0 reseñas)
Ver perfil
Speech recognition and radiology reporting solution that everyone can afford Verbatim is the industrys newest and technically most advanced speech recognition and radiology reporting solution that does not burn a hole in your pocket. With the accuracy of 99% and built-in intuitive workflows, you can complete your reports fast and easy. Verbatim from Saince is a versatile and powerful front end speech recognition software.
(0 reseñas)
Ver perfil
Voice recognition and text analytics software, incorporating IVRs, Surveys, Audio and CSV import. Voice recognition and text analytics software, incorporating IVRs, Surveys, Audio and CSV import.
(0 reseñas)
Ver perfil
Turn speech into text with voice recognition software that is ver 98% accurate & based on conversational modeling for health care & IT. Turn speech into text with voice recognition software that is ver 98% accurate & based on conversational modeling for health care & IT.
(0 reseñas)
Ver perfil
Yactraqs audio mining solution provides call centers with advanced speech analytics capabilities that allow our customers to make call center recordings searchable and reportable. Our customers can utilize our tool to index 100% of their recorded phone calls to uncover high impact and actionable data on Voice-of-the-Customer insights, agent performance evaluation, customer service analysis, compliance applications, and more. Yactraq is cutting edge in audio mining and speech analytics with machine learning driven insights extracted from any audible media.
(0 reseñas)
Ver perfil
Sube tu audio/video y obtén la transcripción en minutos usando la inteligencia artificial. Edita, anota, comparte y exporta tus transcripciones. Sube tu audio/video y obtén la transcripción en minutos usando la inteligencia artificial. Edita, anota, comparte y exporta tus transcripciones.
(0 reseñas)
Ver perfil
Sesame is a voice biometric identification system. Sesame uses natural speech for real-time caller identification, creating a voice print based on previous calls without the need of any enrollment process. What can Sesame do for you? Combats Call Center fraud, classification, anti-spam, answering machine detection, sentiment analysis and management Voice biometric identification system with automatic identification of clients voice, gender, age and language.
(0 reseñas)
Ver perfil
Submission platform for investors to get quality pitches and for startups - get their pitches considered for sure VC submission manager
Ver perfil
Wynyard VFA is an analyzing tool that helps in identifying the person behind an unclaimed voice or decoding the speech in a readable format from an unclear voice. It is a web application that recognizes the identity of the speaker. The application is beneficial for the law enforcement and Government bodies to prevent crimes. The best way to analyze recorded voices and reveal identity.
GoVivaces Automatic Speech Recognition engine can accurately recognize spoken words and convert speech into text. It supports several English accents and can be localized to any language. Also, it supports standard telephony as well as web and mobile applications. The GoVivace's ASR engine is suitable for a wide variety of applications such as IVR systems, call transcription, live dictation and closed captioning. An Automatic Speech Recognition engine which understands natural language accurately and converts speech into text.
(0 reseñas)
Ver perfil
SVI (interactive voice server) that offers advanced voice recognition functions for customer reception. SVI (interactive voice server) that offers advanced voice recognition functions for customer reception.
(0 reseñas)
Ver perfil
Solution to instantly capture speech and turn it into a written transcript. Solution to instantly capture speech and turn it into a written transcript.
(0 reseñas)
Ver perfil
Uniphore Software Solutions provides voice and data technologies to transform mobile phone into an enterprise-service delivery Uniphore Software Solutions provides voice and data technologies to transform mobile phone into an enterprise-service delivery
(0 reseñas)
Ver perfil
State of the art cloud voice recognition and dictation workflow solution designed to be flexible and agile. State of the art cloud voice recognition and dictation workflow solution designed to be flexible and agile.
AppTek artificial intelligence and machine learning-based automatic speech recognition and machine translation platform is deployed for the media and entertainment industry as well as call centers. Leveraging over 30 years worth of experience its scientists and research engineers support the research and development of practical systems AppTek enables the highest quality automatic speech recognition and machine translation solutions available anywhere for enterprises everywhere. AppTek offers proprietary artificial intelligence and machine learning-based automatic speech recognition and machine translation.
(0 reseñas)
Ver perfil
Voice Report enables field employees to dictate reports while on the go using a highly secure speech-to-text solution. Voice Report enables field employees to dictate reports while on the go using a highly secure speech-to-text solution.
(0 reseñas)
Ver perfil
AmberScript automatically transforms your audio and video to text - Upload, search, edit and export with ease. AmberScript automatically transforms your audio and video to text - Upload, search, edit and export with ease.
(0 reseñas)
Ver perfil
With its Voice API, TENIOS operates an interface for voice services, which enables the integration of customer-specific voice applications via web technologies into the cloud communications platform. The Voice API bundles a number of functions (in particular dynamic call control) that allow software applications to initiate and receive calls without developers having to deal with telecommunications technologies and protocols. The TENIOS Voice API enables the integration of speech services into your cloud telephony via common web technologies (https, REST).
(0 reseñas)
Ver perfil
Automatically transcribes video and audio to text. Upload, transcribe and edit your transcript online. Export to any format. Automatically transcribes video and audio to text. Upload, transcribe and edit your transcript online. Export to any format.
(0 reseñas)
Ver perfil
Dictation solution that provides powerful speech-to-text engine, extensive vocabularies, and speaker independent recognition. Dictation solution that provides powerful speech-to-text engine, extensive vocabularies, and speaker independent recognition.
(0 reseñas)
Ver perfil
Transcription software for automated audio and video transcription, delivered to your inbox in minutes. Transcription software for automated audio and video transcription, delivered to your inbox in minutes.
(0 reseñas)
Ver perfil
AISB Engine powered by ArmorVox is a language independent voice biometric engine designed for integration into third party applications, solutions and services which using patented speaker adaptive machine learning algorithms. Applications include contact centers and IVR, websites, chat, messaging, digital apps, social media and wearable technologies. Crossmatch 25M Voiceprints per hour verifying within Milliseconds. Average Company saves 15M with Voice Biometrics over 3 years. Current leading authentication and biometric identification solutions cannot prevent hacking and identity theft!
Ver perfil
Designed to understand human spoken language expressed in a natural way by converting speech-to-text in real-time, using DNN models. Designed to understand human spoken language expressed in a natural way by converting speech-to-text in real-time, using DNN models.
(0 reseñas)
Ver perfil
Harnessing the power of A.I. Happy Scribe automatically transcribes audio to text in over 119 languages. Harnessing the power of A.I. Happy Scribe automatically transcribes audio to text in over 119 languages.
(0 reseñas)
Ver perfil
On-premise communications tool which assists contractors with voice transcription, scheduling, documentation, and task planning. On-premise communications tool which assists contractors with voice transcription, scheduling, documentation, and task planning.
(0 reseñas)
Ver perfil
Omnichannel contact center solution with a predictive dialer, speech analytics, and more.. Omnichannel contact center solution with a predictive dialer, speech analytics, and more..
(0 reseñas)
Ver perfil
Converts audio to text in minutes. Converts audio to text in minutes.
(0 reseñas)
Ver perfil
Allows users to automatically transcribe, caption, subtitle, and voiceover their video and audio files in just minutes. Allows users to automatically transcribe, caption, subtitle, and voiceover their video and audio files in just minutes.
Provides realtime feedback on your pronunciation for English and Dutch children and adults. Provides realtime feedback on your pronunciation for English and Dutch children and adults.
(0 reseñas)
Ver perfil
A programmable platform for developers to easily embed real-time contextual language understanding with the flexibility and control to build unique product experiences. APIs for natural conversation understanding.
(0 reseñas)
Ver perfil
Advanced Digital Dictation is an all-inclusive dictation solution, designed to meet the needs of UK legal and professional firms. This Cloud platform includes dictation, transcription, mobility, administration and management tools, reporting and ongoing updates. Advanced provides a fully managed implementation and training process, plus ongoing helpdesk support. Additional modules available include speech recognition and an outsourced transcription service. Includes dictation, transcription, mobility, administration tools, reporting, training, product updates and ongoing helpdesk support.
(0 reseñas)
Ver perfil
Voice recognition software that models and transcribes at scale. Voice recognition software that models and transcribes at scale.

Ava

(0 reseñas)
Ver perfil
Speech recognition software. Speech recognition software.
(0 reseñas)
Ver perfil
Transcribear is browser-based software that can transcribe audio or video recordings automatically and give you an editable transcript with a few clicks in minutes. Repeated experiments indicate that our speech to text technology can reach more than 95% accuracy with good quality recordings. So far we have offered automatic transcription and annotation services for numerous projects in the areas of publishing or research. Start your free trial today or contact us about your project! Browser-based software that can transcribe audio or video recordings automatically and give you an editable transcript in minutes.
The industry leading speech recognition software used by doctors, lawyers, and other professionals to convert speech into text. Starting at $119.99 for the Premium Edition, Dragon has been used by thousands of professionals for dictation and transcription for over 30 years. Runs on both Windows and Mac platforms. Turn speech into text by dictating into Windows-based applications at speeds up to 160 words per minute.
(0 reseñas)
Ver perfil
Software for speech to text conversion and audio transcription. Software for speech to text conversion and audio transcription.
(0 reseñas)
Ver perfil
Platform for audio to text transcription for freelancers and virtual assistants. Platform for audio to text transcription for freelancers and virtual assistants.

Guía de Compra de Software de reconocimiento de voz

¿Qué es el software de reconocimiento de voz?

El software de reconocimiento de voz , también conocido como software de reconocimiento del habla, permite a los ordenadores interpretar la voz humana y transcribir su voz a texto y viceversa. Además, el software de reconocimiento de voz puede mejorar los asistentes virtuales personales realizando acciones específicas activadas por comandos de voz. Las aplicaciones de software de reconocimiento de voz incluyen sistemas de respuesta de voz interactivos (IVR), que dirigen las llamadas entrantes al destinatario correcto según las instrucciones de voz del cliente.

Ventajas del software de reconocimiento de voz

  • Acelerar la documentación: según un estudio de Stanford, tomar notas al dictado es tres veces más rápido que escribirlas. Las soluciones de reconocimiento de voz liberan al usuario para que este pueda centrarse en las tareas importantes en lugar de tomar notas. Los médicos, por ejemplo, pueden documentar las visitas/citas de los pacientes sin tener que registrar manualmente cada nota. Los agentes de atención al cliente pueden documentar las llamadas sin escribir, lo que acelera el proceso de ayuda al cliente y mejora la calidad general del servicio.
  • Tomar notas con eficacia: durante mucho tiempo se ha tendido a pensar (equivocadamente) que las soluciones de reconocimiento de voz son propensas a cometer errores. Sin embargo, a medida que los sistemas de reconocimiento de voz se han ido acercando a niveles de precisión casi humanos, esta preocupación ha ido en decadencia y ahora es ya prácticamente inexistente. De hecho, en la actualidad los usuarios ven estas soluciones como una forma de mejorar la precisión en sus procesos de toma de notas y documentación.

Funciones comunes del software de reconocimiento de voz

  • Registrar audio: grabar sonido o importar/cargar archivos de audio en el sistema.
  • Transcribir de forma automática: transcribir mensajes de voz y archivos de audio.
  • Multilenguaje: reconocer y admitir múltiples idiomas/dialectos.
  • Análisis de voz a texto: analizar, corregir y monitorizar el habla de transcripciones o grabaciones.
  • Editar texto: revisar el texto transcrito y realizar correcciones básicas (por ejemplo, de faltas de ortografía).

Consideraciones a la hora de comprar software de reconocimiento de voz

  • Aplicación móvil: la proliferación de los smartphones ha convertido estos dispositivos móviles en activos imprescindibles para las empresas. Al igual que en otros mercados, las aplicaciones móviles se han abierto paso en el espacio del software de reconocimiento de voz con aplicaciones que te permiten tomar notas sobre la marcha. También puedes conectar tu dispositivo móvil a auriculares bluetooth y auriculares con micrófono para facilitar el dictado. Si tu empresa cuenta con personal móvil, selecciona aquellos productos que ofrezcan aplicaciones móviles.
  • Necesidades específicas del sector: para maximizar las capacidades de la solución de reconocimiento de voz, deberás usar un sistema cuyas funciones se adapten a las necesidades de tu sector. Ciertos productos de reconocimiento de voz se adecuan más que otros a sectores específicos. Los médicos, por ejemplo, necesitan soluciones de reconocimiento de voz compatibles con la terminología médica. Como comprador, debes evaluar aquellos productos que se adapten a las necesidades concretas de tu sector (no olvides leer las reseñas de los usuarios) y seleccionar en consecuencia.
  • Coste total de propiedad (TCO): tal como se indica en la sección de precios, las soluciones de reconocimiento de voz se encuentran disponibles en una amplia variedad de modelos de precio. Ya que el amplio abanico de opciones puede dificultar una comparación de precios directa, estima las necesidades de tu empresa calculando el número de palabras, la duración del audio y el número de los usuarios para determinar el TCO. Una vez calculado, usa el TCO estimado para seleccionar productos que se ajusten a tu presupuesto real.

Tendencias relevantes en software de reconocimiento de voz

  • El reconocimiento de voz se integrará en los dispositivos inteligentes: el IoT (Internet de las cosas, por sus siglas en inglés) es un área muy prometedora para el software de reconocimiento de voz. El software de reconocimiento de voz integrado en las aplicaciones móviles del IoT permite a los usuarios controlar sus dispositivos inteligentes mediante comandos de voz. Las soluciones de reconocimiento de voz son cada vez más precisas y las empresas siguen adoptando el IoT, por lo que se espera que la integración entre estas dos tecnologías aumente durante los próximos cinco años.
  • Los bots basados en voz son el futuro: la tecnología de reconocimiento de voz también tiene un futuro muy prometedor en el ámbito de los chatbots. Cuando se integran con tecnología de reconocimiento de voz, los chatbots pueden emular las conversaciones humanas en la comunicación con los clientes y son capaces de escuchar sus consultas, interpretarlas y realizar recomendaciones. Las empresas también han comenzado a emplear chatbots, por lo que se espera una adopción similar de los bots basados en voz en los próximos cinco a siete años.