Daniel Wolf
938079a75f
Renamed phoneExtraction to phoneRecognition
2016-09-21 10:32:26 +02:00
Daniel Wolf
12e7a10ab5
Fixed Xcode detection
2016-09-11 11:40:18 +02:00
Daniel Wolf
6d321d2428
Requiring only Boost 1.54, not 1.58
...
Easier to install on older Linux systems
2016-09-11 11:40:16 +02:00
Daniel Wolf
e81c9da999
Requiring only CMake 3.2, not 3.3
...
Easier to install on older Linux systems
2016-09-11 11:40:15 +02:00
Daniel Wolf
b95a3f621c
Fixed Linux build
2016-08-31 22:21:53 +02:00
Daniel Wolf
4403835268
Version 1.0.0
2016-08-11 16:57:05 +02:00
Daniel Wolf
056da65d03
Improved generated package name
2016-08-11 16:57:04 +02:00
Daniel Wolf
16892ae991
Fixed OS X build
2016-08-10 18:24:24 +02:00
Daniel Wolf
ce11e59c04
Converted README file to AsciiDoc
...
This allows for better formatting, more readable tables, easier references etc.
2016-08-09 22:46:11 +02:00
Daniel Wolf
95d46ef0b7
Re-written animation code
...
* Still uses (almost) the same rules, but more powerful underlying concept
* Re-introduced shape H for "L" sounds
* Introduced shape X for idle position
2016-07-31 21:42:37 +02:00
Daniel Wolf
26cae93478
Refactored audio handling
...
Now audio clips can be passed around as const references
and don't carry state any more.
2016-07-27 21:58:37 +02:00
Daniel Wolf
b3b2366468
Re-written library code for parallel execution
...
The new implementation correctly re-throws exceptions on the calling thread
instead of terminating the application.
2016-07-27 21:44:39 +02:00
Daniel Wolf
17b43ad205
Added class Lazy<T>
2016-07-19 21:33:07 +02:00
Daniel Wolf
9bf8355742
Sped up recognition via multithreading
2016-06-26 21:06:21 +02:00
Daniel Wolf
84097756c8
Added ThreadPool class
2016-06-26 14:02:17 +02:00
Daniel Wolf
96b0ad9b1d
Switched to better acoustic model
2016-06-25 22:07:28 +02:00
Daniel Wolf
c6c31a831c
Using WebRTC for voice activity detection (VAD)
...
My simple power-based approach wasn't reliable enough.
2016-06-21 22:20:18 +02:00
Daniel Wolf
944c374415
Migrated to latest CMU Sphinx version
2016-06-19 21:18:40 +02:00
Daniel Wolf
d1bbe8538e
Added more logging
2016-06-15 20:14:43 +02:00
Daniel Wolf
0d488e8de2
Restored dialog option, this time based on language model
...
This approach should be more robust and error-tolerant.
2016-06-10 22:35:27 +02:00
Daniel Wolf
4ed5908627
Implemented US-English G2P using sound change rules
2016-06-03 20:02:34 +02:00
Daniel Wolf
4d45bf7c89
Merged ascii.cpp into stringTools.cpp
2016-06-02 20:09:37 +02:00
Daniel Wolf
4d95b4c2c5
Implemented text tokenization using Flite
2016-06-02 18:24:27 +02:00
Daniel Wolf
942cabd773
Added Flite as library
2016-06-02 18:24:26 +02:00
Daniel Wolf
d4b9a8e0c6
Implemented simple conversion from Unicode string to ASCII
2016-06-02 18:24:25 +02:00
Daniel Wolf
2f31c5aa61
Refactoring
...
* Rewriting Timeline<T> to be sparse, i.e., allow gaps
* Added specialized subclasses BoundedTimeline<T> and ContinuousTimeline<T>
* Timed<T> and TimeRange: has-a, not is-a
* Introducing Timed<void>
2016-05-17 14:28:18 +02:00
Daniel Wolf
9eef09145e
Added getPairs function
2016-05-12 21:44:46 +02:00
Daniel Wolf
895b942df3
Implemented AudioStreamSegment
2016-04-19 22:04:43 +02:00
Daniel Wolf
560281807e
Version 0.2.0
2016-04-17 20:22:17 +02:00
Daniel Wolf
8d2d100376
Refactored enum serialization/deserialization
2016-04-17 20:22:16 +02:00
Daniel Wolf
44d18d00f8
Added header file to CMakeLists.txt
...
This makes navigation easier for me. Plus, ReSharper didn't like not knowing the header files.
2016-04-14 22:14:57 +02:00
Daniel Wolf
7ce79f9c08
Replaced Boost.Log with small custom logger
...
Boost.Log is a complex monstrosity and I can't get it to build on OS X.
2016-04-14 09:42:47 +02:00
Daniel Wolf
fd6b3b1e2f
Supporting multiple export formats
...
- Simplified XML export format
- Added TSV and JSON formats
- Using TSV as standard export format
2016-04-12 21:08:23 +02:00
Daniel Wolf
83291aa96c
Implemented class Timeline<T>
2016-04-09 20:56:25 +02:00
Daniel Wolf
2be3751a4f
Renamed TimeSegment to TimeRange
2016-03-28 20:30:55 +02:00
Daniel Wolf
8c1e24e9c8
Implemented voice activity detection
2016-03-16 21:01:44 +01:00
Daniel Wolf
425f47491c
Fixed compiler warnings
2016-03-16 21:01:43 +01:00
Daniel Wolf
a8900f80ec
Removing DC offset from audio
...
Also a bit of refactoring regarding audio processing
2016-03-16 21:01:43 +01:00
Daniel Wolf
35ec1f8a45
Introduced template functions to unify enum<->string conversions
2016-03-08 22:20:40 +01:00
Daniel Wolf
ad9d8e6567
Renamed `audioInput` directory to `audio`
2016-03-08 18:21:17 +01:00
Daniel Wolf
b78e418a8f
Refactored audio streams
...
* All streams are now mono (simplifies reasoning about samples)
* Streams can be cloned
* Streams can be seeked within
2016-03-07 21:28:31 +01:00
Daniel Wolf
7efea6f56b
Prepared for logging using Boost.Log v2
2016-02-29 21:48:27 +01:00
Daniel Wolf
4662113254
Added Guidelines Support Library (GSL)
2016-02-29 20:57:14 +01:00
Daniel Wolf
ee9e23a6d2
Version 0.1.0
2016-02-02 21:14:30 +01:00
Daniel Wolf
8e7fcc4efe
Implemented two-step phone detection for better accuracy
2016-01-28 14:19:32 +01:00
Daniel Wolf
21a24c17b6
Suffixing OS X package file with '-OSX' instead of '-Darwin'
2016-01-13 14:43:01 +01:00
Daniel Wolf
cfd9ac9da2
Fixed Xcode build
2016-01-13 12:01:22 +01:00
Daniel Wolf
e55cbd3bcf
Saving my changes to Sphinx as patch files
2016-01-12 22:16:24 +01:00
Daniel Wolf
eace5dbd1b
PACKAGE creates ZIP file
2016-01-12 22:07:26 +01:00
Daniel Wolf
2bfe671f82
Simplified directory structure to make Visual Studio build work
2016-01-08 16:59:18 +01:00