Daniel Wolf
4346552312
Improved speed of voice activity detection
...
... by factor 2 by removing second pass.
Also added voice activity detection to progress calculation.
2016-06-15 20:14:51 +02:00
Daniel Wolf
c4b054176c
Fixed WAVE file reader position calculation
...
The bug only showed through massive seek times.
2016-06-15 20:14:44 +02:00
Daniel Wolf
522f6c2019
Made audio stream handling safe for long streams
2016-06-15 20:14:43 +02:00
Daniel Wolf
d1bbe8538e
Added more logging
2016-06-15 20:14:43 +02:00
Daniel Wolf
542a5ee3d8
Added join function for strings
2016-06-15 20:07:51 +02:00
Daniel Wolf
1e29151974
Fixed string conversion for Timed<void>
2016-06-14 17:36:54 +02:00
Daniel Wolf
5cc13cb16f
Improved error message
2016-06-14 17:36:18 +02:00
Daniel Wolf
0d488e8de2
Restored dialog option, this time based on language model
...
This approach should be more robust and error-tolerant.
2016-06-10 22:35:27 +02:00
Daniel Wolf
4ed5908627
Implemented US-English G2P using sound change rules
2016-06-03 20:02:34 +02:00
Daniel Wolf
7a763e8755
Fixed syntax error in sound change data
2016-06-03 20:00:46 +02:00
Daniel Wolf
bf19d267ee
Added sound change code and data
2016-06-03 10:37:47 +02:00
Daniel Wolf
8be6485685
Implemented string conversion from Latin-1 to Unicode
2016-06-02 22:21:37 +02:00
Daniel Wolf
4d45bf7c89
Merged ascii.cpp into stringTools.cpp
2016-06-02 20:09:37 +02:00
Daniel Wolf
4d95b4c2c5
Implemented text tokenization using Flite
2016-06-02 18:24:27 +02:00
Daniel Wolf
8d1c618cec
Patched Flite to prevent name collision with PocketSphinx
2016-06-02 18:24:27 +02:00
Daniel Wolf
942cabd773
Added Flite as library
2016-06-02 18:24:26 +02:00
Daniel Wolf
9f4ebd23e3
Added Flite 1.4 code
...
I'm not using version 2.0 because that version makes it almost impossible
to create a slim build without compiling all the voice synth code (which
we don't need).
2016-06-02 18:24:26 +02:00
Daniel Wolf
d4b9a8e0c6
Implemented simple conversion from Unicode string to ASCII
2016-06-02 18:24:25 +02:00
Daniel Wolf
f1563919e1
Removing redundant prefixes from PocketSphinx log output
2016-05-17 17:56:11 +02:00
Daniel Wolf
c67e916185
Splitting audio into utterances before processing
...
Advantages:
* No problems with long silences (PocketSphinx doesn't like them)
* Potential for parallelization
* Potential for improved phone timing accuracy
2016-05-17 16:01:10 +02:00
Daniel Wolf
bbc933a821
Temporarily removed --dialog option
2016-05-17 14:28:18 +02:00
Daniel Wolf
2f31c5aa61
Refactoring
...
* Rewriting Timeline<T> to be sparse, i.e., allow gaps
* Added specialized subclasses BoundedTimeline<T> and ContinuousTimeline<T>
* Timed<T> and TimeRange: has-a, not is-a
* Introducing Timed<void>
2016-05-17 14:28:18 +02:00
Daniel Wolf
9eef09145e
Added getPairs function
2016-05-12 21:44:46 +02:00
Daniel Wolf
baf2423b27
Added time manipulation functions to TimeRange and Timeline
2016-04-19 22:06:20 +02:00
Daniel Wolf
895b942df3
Implemented AudioStreamSegment
2016-04-19 22:04:43 +02:00
Daniel Wolf
ce204c68de
Fixed constness
2016-04-19 21:12:44 +02:00
Daniel Wolf
c14fb1c7b2
Fixed output format for structured logging
2016-04-19 19:30:38 +02:00
Daniel Wolf
560281807e
Version 0.2.0
2016-04-17 20:22:17 +02:00
Daniel Wolf
8d2d100376
Refactored enum serialization/deserialization
2016-04-17 20:22:16 +02:00
Daniel Wolf
44d18d00f8
Added header file to CMakeLists.txt
...
This makes navigation easier for me. Plus, ReSharper didn't like not knowing the header files.
2016-04-14 22:14:57 +02:00
Daniel Wolf
7ce79f9c08
Replaced Boost.Log with small custom logger
...
Boost.Log is a complex monstrosity and I can't get it to build on OS X.
2016-04-14 09:42:47 +02:00
Daniel Wolf
4941bff739
Replaced strerror_s with (less safe) strerror
...
libc++ (Xcode) doesn't seem to support it.
2016-04-13 10:37:10 +02:00
Daniel Wolf
d8fbd3596b
Fixed UnboundedStream constructor
2016-04-13 10:37:10 +02:00
Daniel Wolf
db6f2e076b
Fixed GCC build
2016-04-12 23:04:16 +02:00
Daniel Wolf
da06576edc
Updated README.md
2016-04-12 22:50:03 +02:00
Daniel Wolf
4b8e38970a
Added hanging indent to help output to make it more readable
2016-04-12 21:23:15 +02:00
Daniel Wolf
fd6b3b1e2f
Supporting multiple export formats
...
- Simplified XML export format
- Added TSV and JSON formats
- Using TSV as standard export format
2016-04-12 21:08:23 +02:00
Daniel Wolf
90e1375f1b
Handling zero-length audio files
2016-04-12 20:45:47 +02:00
Daniel Wolf
7bc4e37a1a
Improved error handling and error messages
2016-04-12 18:02:52 +02:00
Daniel Wolf
04c828506d
Simplified code using Timeline<T>
2016-04-09 22:07:25 +02:00
Daniel Wolf
83291aa96c
Implemented class Timeline<T>
2016-04-09 20:56:25 +02:00
Daniel Wolf
2be3751a4f
Renamed TimeSegment to TimeRange
2016-03-28 20:30:55 +02:00
Daniel Wolf
8c1e24e9c8
Implemented voice activity detection
2016-03-16 21:01:44 +01:00
Daniel Wolf
425f47491c
Fixed compiler warnings
2016-03-16 21:01:43 +01:00
Daniel Wolf
a8900f80ec
Removing DC offset from audio
...
Also a bit of refactoring regarding audio processing
2016-03-16 21:01:43 +01:00
Daniel Wolf
af5a6649c1
Implemented logging to log file
2016-03-08 22:59:44 +01:00
Daniel Wolf
35ec1f8a45
Introduced template functions to unify enum<->string conversions
2016-03-08 22:20:40 +01:00
Daniel Wolf
ad9d8e6567
Renamed `audioInput` directory to `audio`
2016-03-08 18:21:17 +01:00
Daniel Wolf
b78e418a8f
Refactored audio streams
...
* All streams are now mono (simplifies reasoning about samples)
* Streams can be cloned
* Streams can be seeked within
2016-03-07 21:28:31 +01:00
Daniel Wolf
419b0ec469
Making sure log is written in case of exception
2016-03-06 20:40:31 +01:00