grandjanitor.blogspot.com
The Grand Janitor's Blog: November 2007
http://grandjanitor.blogspot.com/2007_11_01_archive.html
The Grand Janitor's Blog. Speech Recognition, Programming and Random Musings of Arthur Chan. Wednesday, November 07, 2007. Slightly related my last post. It relates to an interesting issue of whether we should share the bookshelf in the first place. Why is it an issue? Well, privacy. Suppose someone is malicious and try to figure you out. The best way is to try to gather all information about you and work against you. Do you read Jane Austen? Do you read Stephen King? Do you read Lora Roberts? It seems t...
grandjanitor.blogspot.com
The Grand Janitor's Blog: April 2013
http://grandjanitor.blogspot.com/2013_04_01_archive.html
The Grand Janitor's Blog. Speech Recognition, Programming and Random Musings of Arthur Chan. Saturday, April 20, 2013. The Boston Marathon Explosion : Afterthought. It has been a crazy week. Lives were crazy for Bostonians . and perhaps all Americans. From the explosion to the capture of suspect were only 5 days. I still feel still disoriented from the whole event. Thursday, April 04, 2013. CMUSphinx on Kindle Touch. Nuance Unveils Voice Ad. Why Carl Icahn's Buying a Stake in Nuance. A look on Sphinx3's ...
grandjanitor.blogspot.com
The Grand Janitor's Blog: November 2011
http://grandjanitor.blogspot.com/2011_11_01_archive.html
The Grand Janitor's Blog. Speech Recognition, Programming and Random Musings of Arthur Chan. Monday, November 21, 2011. The Grand Janitor After CMU Sphinx. I have left the development of CMU Sphinx for around 6 years. Geez. Talking about changes. During the time, I went to work for one startup and one defense contractor. Start numerous non-speech related blogs. In my last 6 years, I can only act as a bystander of Sphinx development. I change job again recently and will work with a company which is cl...
grandjanitor.blogspot.com
The Grand Janitor's Blog: May 2013
http://grandjanitor.blogspot.com/2013_05_01_archive.html
The Grand Janitor's Blog. Speech Recognition, Programming and Random Musings of Arthur Chan. Monday, May 06, 2013. Translation of "Looking forward (only 263 weeks left)". As requested by Pranav, a good friend of Sphinx, I translated one of article "Looking forward (only 263 weeks left)" from my Chinese blog "333 weeks" ( original. So here it is, enjoy! April was a long long month. Saturday, May 04, 2013. My Chinese Blogs : Cumulomaniac and 333 Weeks. If you go click them, they are all in Chinese. In ...
grandjanitor.blogspot.com
The Grand Janitor's Blog: May 2012
http://grandjanitor.blogspot.com/2012_05_01_archive.html
The Grand Janitor's Blog. Speech Recognition, Programming and Random Musings of Arthur Chan. Friday, May 18, 2012. What should be our focus in Speech Recognition? If you worked in a business long enough, you start to understand better what type of work are important. As many things in life, sometimes the answer is not trivial. For example, in speech recognition, what are the important ingredients to work on? This makes speech recognition an exciting field similar to chess programming. Indeed the two ...
thegrandjanitor.com
Building a simple SMT | The Grand Janitor Blog V2
http://thegrandjanitor.com/2015/08/05/building-a-simple-smt
The Grand Janitor Blog V2. My Machine Learning Portfolio. Building a simple SMT. August 5, 2015. Which Tutorial to Follow? If you never run an SMT training before, perhaps the more solid way to start is to follow the "Baseline System" link. A better name could be "How to train a baseline system"). At here, there is a rather detail tutorial on how to train a sets of models from WMT13 mini news commentary. Use source of boost, make sure libbz2 was first installed. Then life would be much easier. BLEU = 23&...
thegrandjanitor.com
November | 2013 | The Grand Janitor Blog V2
http://thegrandjanitor.com/2013/11
The Grand Janitor Blog V2. My Machine Learning Portfolio. Monthly Archives: November 2013. Patterns in ASR Coding. November 27, 2013. Many toolkits in ASR appears in the form of unix executables. But the nature of ASR tool is quite a bit different from general unix tools. I will name 3 here:. Another issue is that much coding in numerical algorithm could cause subtle changes of the results, it is tricky to code these changes well. When these changes penetrated to production, it is usually very hard t...
thegrandjanitor.com
How to Compile a Debugged Version of Python | The Grand Janitor Blog V2
http://thegrandjanitor.com/2013/11/25/compiling-a-debugged-version-of-python
The Grand Janitor Blog V2. My Machine Learning Portfolio. How to Compile a Debugged Version of Python. November 25, 2013. For the most time, you shouldn't need to care about the internals of python. It is usually thought as a tool and assumed to be bug-free. Of course, there are moments you should question these assumptions. Sometimes, the interpreter fails itself. It could segfault, it could be too slow. You can go to stare at the source code and hope that you find the issues. Or you can just. Remember,...
thegrandjanitor.com
My Publications | The Grand Janitor Blog V2
http://thegrandjanitor.com/my-publications
The Grand Janitor Blog V2. My Machine Learning Portfolio. Yu Chung" is my Chinese first name. M Siu, H. Gish, A. Chan. W Belfield, S. Lowe, Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery in Computer Speech and Language 28(1): 210-223 (2014). D Huggins-Daines, M. Kumar, A. Chan. M Siu and A. Chan. M Siu, H. Gish, A. Chan. W Belfield, S. Lowe, Unsupervised training of an HMM-based self-organizing unit recognizer with...