Using our spam blacklists (2005-07-11) (original) (raw)

Using our spam blacklists

Introduction
Since October 2003 we have been publishing spamvertized domain and 419 email sender addresses. The lists include domain names and email addresses extracted from spam by jwSpamSpy, our spamfilter. We feed data into SURBL which maintains lists that are amongst the most comprehensive and accurate of their kind.

Despite the high volume of additions we have maintained an extremely low error rate, a fact we take pride in. This low error rate is due to a conservative blacklisting policy (see here for more details) and manual inspection. We also handle email inquiries about them. Data published here is for research and non-commercial use.

If you are a commercial users, such as a corporate user or security vendor and are interested in our data: Please license SURBL data through Securityzones, its authorized reseller. You will receive real-time updates, a wider set of data and data in different formats (rbldnsd, bind, CSV, RPZ, etc).

joewein.de LLC is a Limited Liability company based in Tokyo, Japan.

Download URLs of plain text files
Here are plaintext versions of our blacklists. The domain blacklist consists of two files, the 419 blacklist of one file:

MD5 checksums
The following very small files contain hash codes computed from the above files. You can download the following files every hour or even every 15 minutes (make sure your script works properly before you try this rate!) and then run "md5sum -c filename" on each one. If the checksum fails it means the corresponding data file has changed and it's time to download it as well. That way you will never download copies of the actual data files unless they have changed.

https://joewein.net/dl/bl/dom-bl-base.txt.md5
https://joewein.net/dl/bl/dom-bl.txt.md5
https://joewein.net/dl/bl/from-bl.txt.md5

Blacklisting policy
We are aiming primarily at blacklisting domains that have no legitimate uses. There are a number of domains that have questionable privacy policies or no confirmed opt-in (closed loop) subscription process and are often reported as spam that we don't list, because some people do indeed subscribe to their sites.

The current blacklisting procedure has been in place since December 2003. All entries added to the list before that have been purged. Our false positive rate is less than one per month, which means an error rate below 0.01%. None of these have been widely used domains. Here are the main points about our process: