Fighting the war on spam

    Fighting the war on spam


    Posted: 12 Jan 2010
    Every e-mail system has spam, and Hotmail is no exception. However, our SmartScreen™ spam-fighting system is one of the most effective in the industry. This post will give you the inside scoop on how we use it to fight the menace that is spam, and what you can do to help.

    Spam is war

    Why do people send spam in the first place? It’s simple: for the money. Spam is big, big business. Much of it is also illegal, but, unfortunately, that hasn’t kept people from sending it.

    Believe me, spam is a battle. Spammers won’t ever quit, and they are very clever. Spammers are constantly finding new ways to exploit the services that we provide to our customers, to abuse the very system that you use every day to communicate with your friends and family and to conduct your personal business. But we won’t quit, either.

    Various studies, including the monthly spam reports from Symantec, indicate that upwards of 90% of all the e-mail sent over the Internet is spam. As a result, the e-mail sent to Hotmail, just like e-mail sent to other e-mail providers, is mostly spam. With 350 million active users, Hotmail is a big target, and as a result we receive several billion e-mail messages every day.

    But Hotmail removes 98% of all spam before it can even reach your Inbox. Let’s talk about how we do that, and how we’re striving to get even better.

    Know the enemy

    First, let’s get some definitions out of way. Spam is the term used widely in the industry for unsolicited commercial e-mail that is sent in bulk and indiscriminately—that is, the sender of the e-mail does not have any legitimate reason to be sending it to the recipients. No one wants spam in their Inbox.

    But not all unwanted e-mail is spam. For instance, you may be receiving newsletters or product offers as a result of having signed up on a legitimate website. You may not want them now, but they aren’t being sent to you indiscriminately—after all, you signed up for them! We call this kind of e-mail graymail because you may or may not want it in your inbox—figuring out how to handle graymail is not so black and white (hence the name).

    Our goal is to eliminate as much spam as possible. But we need to avoid classifying good e-mail as spam. We call good messages that are mistakenly identified as spam false positives.

    So, the real trick is to eliminate as much spam as possible (ideally all of it) while keeping false positives to a minimum (ideally zero). In some sense, these are conflicting goals, which makes it a tough balancing act.

    It’s all about the numbers

    Common engineering wisdom tells us that we can’t improve what we can’t measure.

    At Hotmail we track several metrics very closely. We track SITI (“spam in the inbox”) every day, and we track true spam, which is the percentage of SITI that does not include graymail. We also track how often we mistakenly put legitimate e-mail in your Junk folder.

    In addition to using automated systems to measure how we are doing, we of course use feedback from Hotmail customers. If you see a message in your inbox that you think is spam, you can let us know by marking it as Junk. If you see a message in your Junk folder that is not spam, you can let us know by marking it as Not Junk, or just by moving it out of the Junk folder.

    Typically, about 75% of e-mail reported as spam by people using Hotmail is actually graymail—that is, legitimate e-mail, but something that a customer didn’t want and marked as spam. A good example of graymail is a newsletter or notification that you signed up for by making a purchase on a website, but didn’t really want.

    So, how are we doing? Well, back in 2006 our spam problem was pretty bad. True spam was approaching 35%. That means that about one out of every three messages in your inbox was spam. Since then we’ve made tremendous progress—getting true spam under 5% and keeping it there. The graphs below show the spam trends on the Internet in general and in Hotmail over the last several years. The green triangles in the Hotmail graph show the points in time when we released new spam-fighting technology.





    You can see that while the spam problem got worse on the Internet overall, Hotmail’s investments really paid off. We are now seeing historical lows not only on spam, but on our rate of false positives.

    SmartScreen: Our weapon against spam

    We’ve achieved our results through tremendous investments in our SmartScreen technology. Let’s talk about some of the weapons we’ve brought to bear in this fight.

    Connection-time filtering. This is our first line of defense. At any point in time, our spam-fighting system has a view of the world of e-mail senders based on various sources of reputation, as well as recent trends in e-mail content. Sender reputation is largely tied to IP addresses or ranges of addresses. We use this information to set limits on how many messages any given sender can deliver to Hotmail. Setting the limit to zero effectively blocks all e-mail from that sender. For good senders we set the limit such that normal e-mail delivery is easily accommodated while minimizing the potential for abuse should the senders’ own computers get hacked. We use several sources to generate sender reputation:


    • IPs of Bots. We track individual computers that are being used to send spam. Often, these computers are PCs that have been infected with malware and taken over by spammers as part of a botnet.
    • Dynamic IPs. We know that computers with dynamically assigned IP addresses should not be sending e-mail, so we track those and reject e-mail coming from those computers.
    • Known spam entities. We make use of external information, like IP address registrations and Autonomous System Numbers, to identify entire ranges of IP addresses that are known to be used for spam and other malicious purposes.
    • Third-party sources. We contract with several third parties to leverage the very best of what the industry has to offer.


    Content filters. We have numerous filters that we run against incoming e-mail that can identify the e-mail as spam by analyzing the content. This isn’t as simple as just searching for phrases like “watch replica.” Our SmartScreen technology uses machine learning to adapt to the trends and techniques used by spammers. The filtering system applies suitably tailored combinations of policy, content filtering, and reputation filtering to different classes of senders. We complement our own filters with third-party filters to increase the overall effectiveness. The filters identify spam with various levels of confidence. When we are absolutely certain that a message is spam, we delete it; otherwise, we put it in your Junk folder. Our content filters take out about one billion spam messages every day.

    Your preferences. You control spam, too! You can set up your own blocklist and safelist and rules, and we use all of that information to do additional filtering of spam.

    Time-traveling filters. You read that right. We can time travel. Well, at least our filters can. It’s actually pretty simple. We can’t always learn about a new spammer the moment they start sending spam. But once we do identify a spammer we can go back and clean out spam that they’ve already sent to your Inbox before you ever see it. We call those tools time-traveling filters, because in some sense we are able to go back in time and get rid of the spam even after we delivered it! (Of course, if you’ve already seen the spam, we can’t remove it. That, as we all know, would create a time-traveling paradox, which might destroy the universe.)

    Malware detection. We scan attachments and block those that contain known malware and viruses.

    Tools in the Hotmail UI. Finally, we put powerful spam tools right in the user interface. We display a safety bar when you’re reading messages that warns you of potential danger based on the sender’s reputation. Links and images are turned off by default for unknown or untrustworthy senders in order to protect you from evil links and web beacons. You can help us identify spam by marking bad messages as Junk or moving messages to your Junk folder. And you can help us minimize false positives by moving legitimate messages back out of the Junk folder. Every time you mark a message as Junk (or Not Junk, as the case may be), our system gets smarter.

    What you can do to help

    Our technology is only part of the solution. We rely on feedback from our users to help in the war on spam. Here are a few ways you can help make our system smarter as well as contribute to the general health of the e-mail ecosystem.


    • Give feedback on the user experience. There are three ways for you to submit feedback to us on individual messages. You can report them as “Junk,” “Phishing Scam,” or “Not Junk” to make our filters smarter. Marking some as “Not Junk” is just as important as the others because it helps us identify a false positive and gives us an opportunity to not repeat the mistake!
    • Participate in the feedback loop. From time to time, we solicit a cross-section of customers to participate in our Feedback Loop program. The program works like this: Every once in a while, we resend an e-mail message you received and we ask you if it’s junk or not. When you classify the mail, it helps fine-tune our spam filters. So, volunteer if you’re asked to participate.
    • Don’t buy from spammers. Very few people actually click on the links in spammer e-mail. But spammers make money because of the sheer volume of spam—remember, these folks are indiscriminate! A typical spammer can be extremely profitable even if only 50 people respond when they send out a million messages—a conversion rate of just 0.005%.
    • Check your computer for malware. Make sure your own computer isn’t a spambot! You can help keep your computer safe by running security software like Microsoft’s free Security Essentials.
    Looking ahead

    Over the last several years the Hotmail team has made critical investments in SmartScreen technologies to not only contain the spam problem, but to get ahead and stay ahead of the industry. And we’re not done—we continue to battle spam and other kinds of abuse with each release of the service.

    In the next post, I’ll talk a bit more about the graymail problem, go a bit deeper on spam filtering, and offer some more tips for getting rid of spam for those of you who are still experiencing a spam problem.

    Until then, I hope you’ll continue to use Hotmail and keep the comments and feedback coming.

    Dick Craddock
    Group Program Manager, Windows Live Hotmail


    More...
    z3r010's Avatar Posted By: z3r010
    12 Jan 2010



  1. Posts : 8,375
    W7 Ultimate x64/W10 Pro x64/W11 Pro Triple Boot - Main PC W7 Remote PC Micro ATX W7 Pro x64/W11 Pro
       #1

    Live is certainly nothing like the old Hotmail or the old ISP used here for sure!

    Try waiting a few weeks and logging back into the old ISP's mail server to find.... some 400+ spam emails have bloated the inbox! GGrrrr....!!!

    One option you now see in the Live Hotmail there is the option to mark any new sender "safe" or "unsafe". That will also block contents attached to any unexpected mail coming in.
      My Computers


  2. Posts : 355
    Windows 7 Professional X64
       #2

    with my Private Domain emails, Cpanel's spam blocker works really well never had a single piece of spam with it, my Hotmail spam folder gets about a dozen spam emails a month
      My Computer


  3. Posts : 2,292
    Windows 7 Home Premium x64
       #3

    Fighting the war on spam-lolmail.jpg

    I just love seeing stuff like this on Gmail

    I have an email address from my ISP, and rarely face spam at all, so I'm satisfied.
    Thanks for the post, nice read
      My Computer


  4. Posts : 8,375
    W7 Ultimate x64/W10 Pro x64/W11 Pro Triple Boot - Main PC W7 Remote PC Micro ATX W7 Pro x64/W11 Pro
       #4

    I have a freebie with the new ISP here as well that still sees a few every so often while keeping that inactive serves as one means of not attracting spam! Awww yes the more active an account is the more it will be hit!
      My Computers


  5. Posts : 334
    Win7 64bit Ultimate
       #5

    Gmail > Live Mail
      My Computer


  6. Posts : 8,375
    W7 Ultimate x64/W10 Pro x64/W11 Pro Triple Boot - Main PC W7 Remote PC Micro ATX W7 Pro x64/W11 Pro
       #6

    For once I'll take Live Hotmail thank you!

    Other then that however and after having been through a large number of free email providers I look for those that see 0% spam and even block links in MS newsletters that come! with a possible phising attack warning!

    That's when you know you have found one worth bothering with!
      My Computers


 

  Related Discussions
Our Sites
Site Links
About Us
Windows 7 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 7" and related materials are trademarks of Microsoft Corp.

© Designer Media Ltd
All times are GMT -5. The time now is 22:44.
Find Us