ifile-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Ifile-discuss] Re: Updated ifile writeup


From: clemens fischer
Subject: [Ifile-discuss] Re: Updated ifile writeup
Date: 18 Feb 2003 10:26:13 +0100
User-agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.3.50 (i386-unknown-freebsd4.6.2)

"Karl Vogel" <address@hidden>:

>    * I use an old Emacs package (RMAIL) to read my mail, and I'm
>      not sure if it will handle maildir or MH format.  I know, VM is
>      probably better, but I've already put some effort into messing with
>      the RMAIL Lisp stuff.

hello fellow emacs comrade  :)  gnus digs maildirs now.

>    Glad you like it.  My spam corpus is too big to put on my ISP homepage:
>
>       18.0M   credit
>        0.5M   diploma
>        6.0M   fraud
>       12.0M   gtaylor
>        0.2M   license
>       28.0M   local
>      204.0M   net-abuse
>        3.0M   uk-corpus
>      ------------------
>      271.7M   TOTAL
>
>    It gzips down to about 70 Mbytes, which still puts me over my quota.
>    I need a better ISP (suggestions welcome).  If anyone wants to mirror
>    the corpus, let me know; maybe we can work out some type of FTP thing.

a spam-corpus this large definitely deserves special care.  did you
think about making it a sourceforge project?  there are several sf
projects about bayesian text-classifiers, and some of them have links
to spam corpora, but there's no project collecting spam
systematically.  and btw, what is "gtaylor" spam?

  clemens




reply via email to

[Prev in Thread] Current Thread [Next in Thread]