Commit Graph

486 Commits

Author SHA1 Message Date
Daniel Wolf 5c428793c9 Version 1.9.0 2019-01-28 21:31:33 +01:00
Daniel Wolf f55bcebf73
Merge pull request #61 from DanielSWolf/bugfix/misc
Multiple small fixes and improvements
2019-01-24 20:37:07 +01:00
Daniel Wolf 8d958d09fb Always export absolute audio file path in XML and JSON format
Fixes #59
2019-01-23 21:43:21 +01:00
Daniel Wolf 06b4855d6d Set helpful configuration settings for the phonetic recognizer
Copied from the PocketSphinx recognizer
2019-01-23 21:43:17 +01:00
Daniel Wolf c821788569 Discard very short segments of voice activity
This prevents short flickers from false VAD positives.
This fixes a regression recently introduced in 2bbad258c0.
2019-01-23 21:43:17 +01:00
Daniel Wolf 21392d32cc Improve read performance for WAVE files
Fixes #58
2019-01-23 21:43:16 +01:00
Daniel Wolf f3d4cfbb31 Fix gaps in phonetic recognition
Randomly, entire utterances yielded no phones with the phonetic recognizer.
The cause was a check for empty utterances that made sense for word
recognition, but not for phonetic recognition.
2019-01-21 22:22:28 +01:00
Daniel Wolf 357cb0b65e
Merge pull request #57 from DanielSWolf/bugfix/#56-script-editor-pops-up
Prevent debug info from popping up
2019-01-10 22:13:35 +01:00
Daniel Wolf c68c12c865 Prevent debug info from popping up
Fixes #56
2019-01-10 20:47:43 +01:00
Daniel Wolf e5066b549f
Merge pull request #55 from DanielSWolf/bugfix/#54-unicode-paths
Fix special characters in output file path
2019-01-04 21:38:47 +01:00
Daniel Wolf 9ba5fc9653 Fix special characters in output file path
Issue #54
2019-01-04 21:29:03 +01:00
Daniel Wolf d52bec8e55
Merge pull request #53 from DanielSWolf/bugfix/#52-unwanted-mouth-movement
Prevent unwanted mouth movement at beginning
2019-01-04 21:03:25 +01:00
Daniel Wolf 2bbad258c0 Do not use multithreading for VAD
WebRTC adapts to the audio signal. If we slice the audio clip into multiple
shorter clips, then perform VAD on them in parallel, the result may not be as
good.
2019-01-04 21:02:41 +01:00
Daniel Wolf a723942f22 Convert audio to 8kHz before feeding it to WebRTC for VAC
This prevents false positives at the beginning of the audio stream. Fixes issue #52.
2019-01-04 21:02:41 +01:00
Daniel Wolf 10632576bd
Merge pull request #51 from DanielSWolf/bugfix/#46-progress
Prevent incomplete progress reporting
2019-01-03 17:02:27 +01:00
Daniel Wolf 1393534624 Prevent utteranceToPhones from reporting incomplete progress 2019-01-03 13:10:05 +01:00
Daniel Wolf 494b1ebebd Log incomplete tasks 2019-01-03 13:10:05 +01:00
Daniel Wolf 59f2993174
Merge pull request #50 from DanielSWolf/feature/cleanup
Code cleanup
2019-01-03 08:50:10 +01:00
Daniel Wolf 367c645bb3 Code cleanup
* Fix linter warnings
* Unify code formatting
* Fix typos
2019-01-02 22:27:45 +01:00
Daniel Wolf 238687e33b
Merge pull request #49 from DanielSWolf/feature/#38-upgrade-utf8proc
Upgrade to utf8proc 2.2.0
2019-01-02 16:21:00 +01:00
Daniel Wolf cdc9c5b1c4 Upgrade to utf8proc 2.2.0
Fixes issue #38
2019-01-02 15:18:00 +01:00
Daniel Wolf f4418ff25a
Merge pull request #48 from DanielSWolf/bugfix/#46-progress
Fix progress reporting
2019-01-02 15:04:27 +01:00
Daniel Wolf 1d04b01654 Update changelog 2019-01-02 14:52:07 +01:00
Daniel Wolf e87ccc8816 Hide progress spinner when ProgressBar gets destructed 2019-01-02 14:38:53 +01:00
Daniel Wolf 3189fc8976 Don't clear away last progress bar 2019-01-02 14:30:03 +01:00
Daniel Wolf e325917abe Preserve progress when continuing console output 2019-01-02 14:30:02 +01:00
Daniel Wolf d0c9a294e9 Correctly print final progress when ProgressBar is destructed
Fixes issue #46
2019-01-02 14:30:02 +01:00
Daniel Wolf 11a29ddef6 Split progress code 2019-01-02 13:22:03 +01:00
Daniel Wolf 44d6cb83de Update documentation 2019-01-02 12:16:35 +01:00
Daniel Wolf c078e6186e
Merge pull request #47 from DanielSWolf/feature/phonetic-recognition
Phonetic recognition
2019-01-02 11:36:08 +01:00
Daniel Wolf d029458c70 Document phonetic recognizer 2019-01-01 23:16:12 +01:00
Daniel Wolf bfc98a1c81 Add recognizer support to Spine integration 2019-01-01 22:51:02 +01:00
Daniel Wolf 3bf7a00d42 Add recognizer support to After Effects integration 2019-01-01 22:51:02 +01:00
Daniel Wolf 7ebe9b53e8 Internal version 1.9.0-pre.1 2019-01-01 22:51:02 +01:00
Daniel Wolf 610f490046 Implement generic concept of recognizers with options pocketSphinx and phonetic 2019-01-01 22:51:01 +01:00
Daniel Wolf 3ed38ada2f Fix path separator 2018-10-08 20:31:13 +02:00
Daniel Wolf 55a099ddc6 Version 1.8.0 2018-09-01 21:02:44 +02:00
Daniel Wolf bf044ca127 Update After Effects plugin to support non-WAVE audio files 2018-09-01 21:02:43 +02:00
Daniel Wolf c93da48c77 Rename changelog
... and fix typo
2018-09-01 21:02:43 +02:00
Daniel Wolf ce02a3a003 Change format of changelog 2018-09-01 20:31:50 +02:00
Daniel Wolf 49097b00d7 Update documentation 2018-09-01 15:19:14 +02:00
Daniel Wolf 8648ed5640
Merge pull request #44 from DanielSWolf/feature/#40-ogg-vorbis
Support for Ogg Vorbis file format
2018-09-01 10:34:54 +02:00
Daniel Wolf c22550f7f8 Support Unicode paths when opening Ogg Vorbis files 2018-09-01 10:24:05 +02:00
Daniel Wolf 5e7e6f5f87 Increase Ogg Vorbis read performance through buffering
Reduces runtime by factor 10
2018-09-01 10:24:05 +02:00
Daniel Wolf e13c222e28 Add support for Ogg Vorbis file format
#40
2018-09-01 10:24:05 +02:00
Daniel Wolf 1625de64e2 Move file utilities into their own file 2018-09-01 10:24:04 +02:00
Daniel Wolf d077aae74c Add libvorbis 2018-09-01 10:23:58 +02:00
Daniel Wolf b4a2a295d1 Patch libogg for generic Unix compilers 2018-09-01 10:23:53 +02:00
argent0 e02de085bb Fix build error resulting from assigning boost::uuids::random_generator (#41)
Using a static generator to generate uuids

This time explicitly writing the type.
2018-07-19 20:19:38 +02:00
Daniel Wolf a446209c31 Add libogg 2018-07-13 22:59:47 +02:00