ARPA Language Model File Format

August 12th, 2008

The format of a ARPA language model file, as far as I can tell, is not documented outside of the CMU SLM toolkit source code. I’m regurgitating it here in the hopes that future Google searches on this topic are more fruitful than mine were. :)

/* This is the format introduced and first used by Doug Paul.
   Optionally use a given symbol for the UNK word (id==0).
*/
/*
Format of the .arpabo file:
------------------------------
data ngram 1=4989 ngram 2=835668 ngram 3=12345678 1-grams: … -0.9792 ABC -2.2031 … log10_uniprob(ZWEIG) ZWEIG log10_alpha(ZWEIG) 2-grams: … -0.8328 ABC DEFG -3.1234 … log10_bo_biprob(WAS | ZWEIG) ZWEIG WAS log10_bialpha(ZWEIG,WAS) 3-grams: … -0.234 ABCD EFGHI JKL … end */

“Conservapedia”

June 30th, 2008

There was a really interesting story about a month back about a scientist who has grown a culture of e-coli over 20 years, and discovered a series of mutations leading to an ability to metabolize citrate - something never before observed in this species. Well, this has apparently angered the anti-evolution crowd at “conservapedia” (which was founded because wikipedia was supposedly too liberal), and they’ve taken to discrediting the guy. Hilarity ensues. :)

http://arstechnica.com/articles/culture/conservapedias-evolutionary-foibles.ars
http://www.conservapedia.com/Conservapedia:Lenski_dialog

Reminds me of one of my favorite Steven Colbert quotes: “reality has a well known liberal bias” =P

SVN/Trac Integration

June 27th, 2008

A long time ago I wrote an article titled CVS/Bugzilla Integration. Here is an updated version for subversion and trac.

Note that it doesn’t require access the trac python libs or database; it just uses HTTP. But it does require BeautifulSoup. Install it by adding it to your hooks post-commit script. (it takes the normal params of REPOS and REV).

Read the rest of this entry »

gPapers in Nature

May 3rd, 2008

Check it out… I was mentioned in a recent Nature article about research paper management tools:

http://www.nature.com/news/2008/080430/full/453012b.html (doi:10.1038/453012b)

Cool, eh? :-)

Google News Photos

April 29th, 2008

The Google News photo algorithm comes up with some hilarious combinations sometimes. :-P

homer_brain.png

(it’s from an old Simpson’s episode, if you don’t get it…)

NSF GK-12 Fellowship

April 21st, 2008

Dear Students,

Congratulations. You have been selected for the NSF GK-12 fellowship. This fellowship as you recalled I mentioned pays XX,000 a year, starting with this Summer, plus an extra stipend for tuition.

If you recall, it involves setting up an after-school program for middle-schools to motivate students to choose computer science (or a STEM) for their future careers.

We will come up with the official letter (the one you have to sign) soon (probably next week), but I just wanted to inform you that you have been selected.

If you are still with us on this project, please reply to me acknowledging your acceptance. We had more candidates than positions, so I have to make sure you are all on board before I notify those who were not selected.

Best wishes, and I look forward to hearing from you.


Dr. Jorge A. Cobb

w00t! =P

XOXOXO

April 21st, 2008

I just want to mention that I am madly in love. Yay to me, and how wonderfully amazing my sweet, smart, nerdy, independent, and incredibly gorgeous girlfriend is. I love you babydoll. :)

DeSiGLE - Derek’s Simple Gnome LaTeX Editor

April 10th, 2008

I wanted a simple GTK-based LaTeX editor with spell checking, syntax highlighting and a preview pane. None that I could find fit this bill, so I rolled my own.

Website: http://desigle.org/

Use if you wish.

Daily caffeine ‘protects brain’

April 3rd, 2008

Coffee may cut the risk of dementia by blocking the damage cholesterol can inflict on the body, research suggests.

The drink has already been linked to a lower risk of Alzheimer’s Disease, and a study by a US team for the Journal of Neuroinflammation may explain why.

W00t! =P

http://news.bbc.co.uk/2/hi/health/7326839.stm

Dreams

March 25th, 2008

Sometimes they’re really weird. Night before last, I was dreaming, and woke up. Looked around, realized I had been dreaming, plopped my head back down and fell right back asleep. Although instead of going into another dream, a message popped up (almost as if I was using a computer) saying (via white text on a black background):

If you would like to continue this dream, a charge of $19.95 will be applied to your credit card. Do you want to proceed?

[Yes] [No]

But my brain knew it was dreaming, and the absurdity of it made me laugh hard enough to wake me up again. So +1 for “weird dreams”. :)

P.S. Oddly enough, usually I can’t read in my dreams. The words are almost always confusingly jumbled, not just in word/letter transposition but in physical orientation and alignment. Sometimes if I focus I can pull them together (literally: they move about in what I can only describe as brownian motion) and make sense of them, but not often. It’s an interesting enough phenomenon that I often try to look for newspapers or books in my dreams when I realize I’m dreaming, and it’s a great litmus test for when I’m not sure.


<Kered.org>   © Copyright 2000-2005 by Derek Anderson
Get Firefox