[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Ifile-discuss] Re: Updated ifile writeup
From: |
clemens fischer |
Subject: |
[Ifile-discuss] Re: Updated ifile writeup |
Date: |
18 Feb 2003 10:26:13 +0100 |
User-agent: |
Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.3.50 (i386-unknown-freebsd4.6.2) |
"Karl Vogel" <address@hidden>:
> * I use an old Emacs package (RMAIL) to read my mail, and I'm
> not sure if it will handle maildir or MH format. I know, VM is
> probably better, but I've already put some effort into messing with
> the RMAIL Lisp stuff.
hello fellow emacs comrade :) gnus digs maildirs now.
> Glad you like it. My spam corpus is too big to put on my ISP homepage:
>
> 18.0M credit
> 0.5M diploma
> 6.0M fraud
> 12.0M gtaylor
> 0.2M license
> 28.0M local
> 204.0M net-abuse
> 3.0M uk-corpus
> ------------------
> 271.7M TOTAL
>
> It gzips down to about 70 Mbytes, which still puts me over my quota.
> I need a better ISP (suggestions welcome). If anyone wants to mirror
> the corpus, let me know; maybe we can work out some type of FTP thing.
a spam-corpus this large definitely deserves special care. did you
think about making it a sourceforge project? there are several sf
projects about bayesian text-classifiers, and some of them have links
to spam corpora, but there's no project collecting spam
systematically. and btw, what is "gtaylor" spam?
clemens