Commit Graph

52 Commits

Author SHA1 Message Date
Daniel Wolf d29fb6d61a WIP resampling 2023-01-28 22:15:31 +01:00
Daniel Wolf 229ef2a25b Add audio segment 2023-01-28 22:15:30 +01:00
Daniel Wolf 3d3cbdabb1 Add memory audio clip 2023-01-28 22:15:29 +01:00
Daniel Wolf ca575d841e Add support for reading Ogg Vorbis files 2023-01-28 22:15:28 +01:00
Daniel Wolf 9c3b1fb554 Add support for reading WAVE files 2023-01-28 22:15:26 +01:00
Daniel Wolf 2cd2ba77b9 Add AudioClip trait 2023-01-28 22:15:23 +01:00
Daniel Wolf e1e137945e Remove C++ codebase 2022-09-16 18:07:45 +02:00
Daniel Wolf 205540a91b Improve animation rules for "f" sound 2022-06-14 19:00:27 +02:00
Daniel Wolf 9a32641a99 Minor syntax fixes
Fixes #103
2021-10-15 18:14:55 +02:00
Daniel Wolf 77588fb40e Improve WAVE file reader to handle more formats
Fixes #101
2021-10-06 20:48:45 +02:00
Daniel Wolf 3c0befa070 Add test WAVE files 2021-10-04 20:12:28 +02:00
Daniel Wolf 5f293cdd33 Extract WAVE format parsing into its own function 2021-10-04 20:12:00 +02:00
Daniel Wolf 71ec4dca3f Use GitHub Actions for CI
Closes #98
2021-06-25 21:13:38 +02:00
Daniel Wolf 8a94fb7db0 Replace boost::filesystem with std::filesystem
This allows us to use a header-only copy of Boost
2021-06-25 20:58:50 +02:00
Daniel Wolf 74822c4a84 Switch to C++17 2021-06-25 20:58:50 +02:00
Daniel Wolf 2b9d305ea1 Fix warnings 2021-06-25 20:58:50 +02:00
Daniel Wolf a5ebd593d4 Update GoogleTest to latest master 2021-06-23 22:00:12 +02:00
Daniel Wolf 5a6e8346a3 Fix build with Boost 1.70
Fixes #88
2020-09-17 20:26:53 +02:00
Daniel Wolf fef283de43 Implement DAT exporter for Moho 2019-06-27 22:30:16 +02:00
Daniel Wolf 2218d9c69b Improve animation rule for OW sound 2019-05-25 21:17:03 +02:00
Daniel Wolf 18f829d03d Fix build errors in VS2019 2019-05-25 08:03:52 +02:00
Daniel Wolf 6c62318816 Clone timeline to prevent modification during iteration
Fixes #65
2019-02-10 20:18:47 +01:00
Daniel Wolf 8d958d09fb Always export absolute audio file path in XML and JSON format
Fixes #59
2019-01-23 21:43:21 +01:00
Daniel Wolf 06b4855d6d Set helpful configuration settings for the phonetic recognizer
Copied from the PocketSphinx recognizer
2019-01-23 21:43:17 +01:00
Daniel Wolf c821788569 Discard very short segments of voice activity
This prevents short flickers from false VAD positives.
This fixes a regression recently introduced in 2bbad258c0.
2019-01-23 21:43:17 +01:00
Daniel Wolf 21392d32cc Improve read performance for WAVE files
Fixes #58
2019-01-23 21:43:16 +01:00
Daniel Wolf f3d4cfbb31 Fix gaps in phonetic recognition
Randomly, entire utterances yielded no phones with the phonetic recognizer.
The cause was a check for empty utterances that made sense for word
recognition, but not for phonetic recognition.
2019-01-21 22:22:28 +01:00
Daniel Wolf 9ba5fc9653 Fix special characters in output file path
Issue #54
2019-01-04 21:29:03 +01:00
Daniel Wolf 2bbad258c0 Do not use multithreading for VAD
WebRTC adapts to the audio signal. If we slice the audio clip into multiple
shorter clips, then perform VAD on them in parallel, the result may not be as
good.
2019-01-04 21:02:41 +01:00
Daniel Wolf a723942f22 Convert audio to 8kHz before feeding it to WebRTC for VAC
This prevents false positives at the beginning of the audio stream. Fixes issue #52.
2019-01-04 21:02:41 +01:00
Daniel Wolf 1393534624 Prevent utteranceToPhones from reporting incomplete progress 2019-01-03 13:10:05 +01:00
Daniel Wolf 494b1ebebd Log incomplete tasks 2019-01-03 13:10:05 +01:00
Daniel Wolf 367c645bb3 Code cleanup
* Fix linter warnings
* Unify code formatting
* Fix typos
2019-01-02 22:27:45 +01:00
Daniel Wolf cdc9c5b1c4 Upgrade to utf8proc 2.2.0
Fixes issue #38
2019-01-02 15:18:00 +01:00
Daniel Wolf e87ccc8816 Hide progress spinner when ProgressBar gets destructed 2019-01-02 14:38:53 +01:00
Daniel Wolf 3189fc8976 Don't clear away last progress bar 2019-01-02 14:30:03 +01:00
Daniel Wolf e325917abe Preserve progress when continuing console output 2019-01-02 14:30:02 +01:00
Daniel Wolf d0c9a294e9 Correctly print final progress when ProgressBar is destructed
Fixes issue #46
2019-01-02 14:30:02 +01:00
Daniel Wolf 11a29ddef6 Split progress code 2019-01-02 13:22:03 +01:00
Daniel Wolf 610f490046 Implement generic concept of recognizers with options pocketSphinx and phonetic 2019-01-01 22:51:01 +01:00
Daniel Wolf 3ed38ada2f Fix path separator 2018-10-08 20:31:13 +02:00
Daniel Wolf 8648ed5640
Merge pull request #44 from DanielSWolf/feature/#40-ogg-vorbis
Support for Ogg Vorbis file format
2018-09-01 10:34:54 +02:00
Daniel Wolf c22550f7f8 Support Unicode paths when opening Ogg Vorbis files 2018-09-01 10:24:05 +02:00
Daniel Wolf 5e7e6f5f87 Increase Ogg Vorbis read performance through buffering
Reduces runtime by factor 10
2018-09-01 10:24:05 +02:00
Daniel Wolf e13c222e28 Add support for Ogg Vorbis file format
#40
2018-09-01 10:24:05 +02:00
Daniel Wolf 1625de64e2 Move file utilities into their own file 2018-09-01 10:24:04 +02:00
Daniel Wolf d077aae74c Add libvorbis 2018-09-01 10:23:58 +02:00
Daniel Wolf b4a2a295d1 Patch libogg for generic Unix compilers 2018-09-01 10:23:53 +02:00
argent0 e02de085bb Fix build error resulting from assigning boost::uuids::random_generator (#41)
Using a static generator to generate uuids

This time explicitly writing the type.
2018-07-19 20:19:38 +02:00
Daniel Wolf a446209c31 Add libogg 2018-07-13 22:59:47 +02:00