30 January, 2007

Microformats are perfect for SPAMMERS

Microformats logo

I’m very worried about the use of Microformats for contact details and I certainly won’t be using them.

According to Microformats.org, the hCard is a simple, open, distributed format for representing people, companies, organizations, and places.

The purpose of the hCard is to make it easier to download a person or company’s contact details into an application such as Outlook. They’re machine-readable and machine-discoverable.

Why Microformats are good
At a glance, they’re brilliant and the hCard is good because it means no more copying and pasting text from Web pages. Furthermore, Microformats is a decent example of how the Semantic Web can benefit everyone, albeit a very tiny example and a tiny implementation of the Semantic Web.

Why Microformats can be bad
So, if its now very easy for tools (machines) to find your contact details online, won’t this make it even easier for SPAMMERS to find your details and automatically load them into their contact list?

I tend to use the format paul [at] segala [dot] com to make it a little more difficult for SPAMMERS such as easyJet to email me and I still get hammered with loads of rubbish. So, I’m certainly not going to put my email address in a machine-readable format to make it extremely simple for them to grab my details. Furthermore, I’m not going to use a Microformat for my postal address so direct marketing companies can have a field day.

Think about it, if everyone had a hCard for their contact details, SPAMMERS and direct marketing companies would no longer need to employ humans.

Perhaps if we coupled the hCard with Friend of a Friend (FOAF) we could enable trust in some way – i.e. so only trusted parties could download your hCard.

Marc Canter thinks Microformats should be even more open – I agree as long as it’s not the hCard.

Have I got this totally wrong (which is likely), or is everyone else jumping on the ‘I love Microformats and OpenID’ bandwagon in case they’re not perceived as ‘with it’?!

 

13 Responses to “Microformats are perfect for SPAMMERS”

  1. [...] Paul Walsh (of Segala) asks: “Have I got this totally wrong (which is likely), or is everyone else jumping on the ‘I love Microformats and OpenID’ bandwagon in case they’re not perceived as ‘with it’?!” – Two different issues here dude. 1) OpenID allows for federated IDs, for SSO (single sign-on) and hopefully (soon) attribute exchange. That’s a good thing. And easily spam resistent. 2) Microformats on the other hand are a partial solution to ‘what are the meta-data standards’. That of course is a good thing to solve, but without a complete solution – it’s – well only partially solved. So I’m not asking for more OPEN microformats – I’m asking for microformats as feeds and file formats not JUST page tags. Get it? [...]

  2. Mike Butcher 31 January 2007 at 3:08 am #

    I am not an expert on this, but I recall having the same issues with the FOAF format – machine readable contact details.

  3. Paul Walsh 31 January 2007 at 3:14 am #

    Thanks for dropping by Mike. You’re right, you could have the same issue with FOAF at present. The FOAF specification is still under development (sigh!). So, some trust elements could be added. The w3c Content Labels working group is lobbying to have it updated to describe organisations as well as people.

  4. [...] I just noticed that Marc Canter has responded to my original post about Microformats and OpenID. Marc reminds me that they’re two different issues. I actually knew this already so I should have known better and not mentioned OpenID in my post. Although, it did attract Mike’s attention so that’s a good thing. [...]

  5. Kingsley Idehen 31 January 2007 at 6:10 am #

    It’s important to note that FOAF or hCard by definition do not need to expose all data elements (property values). When a service provider generates FOAF or hCard they must provide their customers/members with the ability to designate what stays private and what goes public. The Semantic Web presents immense opportunities to Social Networking Service providers once they are able to understand the implications of my comments above since they are all contact management service providers at the end of the day with SPAM protection as essential part of the value they offer.

    I note that already does this re. FOAF.

    Thus, a Social Network can provide Open Data Access (via FOAF and hCard for instance) and continue to grow. The real kicker is that growth will actually be exponential since the value consumption will feed of machine to machine consumption of basic and value added services instead of the human to machine dominated interaction of today.

  6. Conor O'Neill 31 January 2007 at 6:48 am #

    As I said over at Tom’s blog, if you don’t want your information to be used by the wrong people, don’t put it on the web. hCard makes everyone’s lives easier, both the good guys and the bad guys.

    Spammers overcame silly messing like email obfuscation years ago. I’m happy for all my details to be out there as a hCard and I’ve seen no increase in email/postal/phone spam since I did.

  7. Kingsley Idehen 31 January 2007 at 7:09 am #

    Conor,

    Again, there is no rule that says you put data in hCard or FOAF that you don’t want to be exposed to the public. The key thing is that the hCard or FOAF generator honors this.

    For instance, you can build a valuable data source from a social network without revealing email and other contact details. The richness of the graph the underpins the social network will naturally stimulate the need for more personal connections to be established etc. And it is at this point that the hCard or FOAF provide can kick in as the communications processing manager based on in-built message handling and routing rules.

    Social Networks and contact manaement engines at the end of the day, whether they know this or not.

    Social Networks that play well with the Semantic Web (along the lines I describe) will provide valuable White List Services to a new generation of Rules Bases MTA (Mail Transfer Agent) filters and Sink Drivers (the piece that persist mails on behalf of the MTA into a file system of DBMS engine).

    SPAM will be killed by a combination of RDF Data based White Lists, Rules, and MTA filters and Sink Drivers.

  8. Conor O'Neill 31 January 2007 at 1:57 pm #

    Yeah, I made the point about “how much” on Tom’s blog. I think the only field that is compulsory in a hCard is fn (full name).

  9. Sam Sethi 31 January 2007 at 3:31 pm #

    Paul, we all agree the semantic web is the way forward. One way is microformats and the other way is RDF.

    The link below is from my friend Mark Birbeck who co-wrote part of the XHTML2, Xforms and RDFa specs showing there is a possible third way RDFa.
    http://internet-apps.blogspot.com/2006/10/rdfa-and-microformats.html

    Microformats are working today because they are simple to implement and are providing immediate value already. Tails, Operator and WebCards along with sites Flickr, Yahoo, Upcoming etc

    RDF is the logical systematic way to create a search and query internet but it feels like it is taking for ever to get started and is so hard to grasp for the average user with no immediate return.

    The comparison I will offer is the fight of HTML against the Client/Server C++ world.

    I remember when at Netscape presenting the idea of a browser with simple HTML and being told by corporates it will never take off it was too simple etc. Well we all know what happened there.

    I think we are going to see a convergence of semantic ideas in the next few years thanks to XHTML2 (namespaces), GRRDL and browser support for Xforms etc.

    So my interest in MF’s now is because I can do something with them and they are adding value. I am looking at RDFa and XHTML2 and waiting to see how much support Firefox3 adds.

    The one company that I think will come from left-field to make RDF work is Google. They are big supporters of Atom/GData/XQuery, Atom Publishing Protocol which is in the new Blogger release to replace XML-RPC. Add to this their support for XMPP in GTalk and GCalendar and you can see that Google could start to add in FOAF support firstly in Atom. Google have not supported MF’s which makes me think they will leap frog everyone and go straight to the RDF semantic web and make the web both searchable and easy to query. see Gdata.

    So it is actually a semantic point whether we trust hCards or FOAF.

  10. Kingsley Idehen 31 January 2007 at 3:38 pm #

    Sam,

    Microformats vs. RDF is not a battle that anyone wants to wage. These efforts are so complimentary I cannot put into words.

    Microformats provide a nice foundation for unobtrusively producing RDF Data Model Instance Data without placing the burden on the end-user. In short, this is another effort (like Web 2.0 as a whole) that actually helps to crystallize the entire Semantic Web vision.

    Microformats and RDF are good for each other. Mark Bierbeck certainly knows that hence his effort re. RDF/A :-) .

    I will soon be exposing some live demos to expose the critical point I am trying to make.

  11. Paul Walsh 1 February 2007 at 9:10 am #

    I don’t think it’s a competition between Microformats and RDF. In fact, I don’t think you can really compare them. When I chatted with David Sifry from Technorati in Paris his face almost turned red when I mentioned RDF and FOAF. Personally I think it was the usual misconception about the Semantic Web that prompted his retort. Although he did give me Tantek’s email address when I gave examples such as Content Labels as a means of demonstrating benefits in the very near future.

    I don’t believe the Semantic Web has to be all about RDF either. Further to your question about Firefox (Sam), it’s a huge supporter of RDF already, so introducing new concepts such as Content Labels is seamless.

  12. [...] I just noticed that Marc Canter has responded to my original post about Microformats and OpenID. Marc reminds me that they’re two different issues. I actually knew this already so I should have known better and not mentioned OpenID in my post. Although, it did attract Mike’s attention so that’s a good thing. [...]

  13. Chris Cressman 19 June 2009 at 4:11 pm #

    I’d like to confirm what other commenters have mentioned: hCard has only two required properties, vcard (the container) and fn (formatted name). Authors therefore have granular control over what data they want to be machine readable. I regularly use hCard to mark up my name and URL only.

    I also know there is a blogger (can’t remember who, unfortunately) that uses server-side technology to control who can view his contact info. Everyone gets his name and URL, but only friends who log in with their OpenID get his full details.

Leave a Reply

About

Founded in 2003 and privately owned, Segala is a specialist in testing and certification.

Segala’s mission it to help make the Web more reliable, safe and trustworthy. Our method of certification helps us to realize this mission by exposing more information about the suitability of each website in search results – enabling users to make informed decisions about which sites to visit. Read More…

Contact

Contact us by emailing daphne@segala.com or call +353 (0)1 2931966. Our address is 19 The Mall, Beacon Court, Sandyford, D18. Ireland.

PHVsPjxsaT48c3Ryb25nPndvb19hYm91dDwvc3Ryb25nPiAtIGFib3V0PC9saT48bGk+PHN0cm9uZz53b29fYWRfYmVsb3dfaW1hZ2U8L3N0cm9uZz4gLSAvaW1hZ2VzL2FkNDY4LmpwZzwvbGk+PGxpPjxzdHJvbmc+d29vX2FkX2JlbG93X3VybDwvc3Ryb25nPiAtIGh0dHA6Ly93d3cud29vdGhlbWVzLmNvbTwvbGk+PGxpPjxzdHJvbmc+d29vX2FsdF9zdHlsZXNoZWV0PC9zdHJvbmc+IC0gZ3JlZW50ZWEuY3NzPC9saT48bGk+PHN0cm9uZz53b29fYmxvY2tfaW1hZ2U8L3N0cm9uZz4gLSAvaW1hZ2VzL2FkMzM2LmpwZzwvbGk+PGxpPjxzdHJvbmc+d29vX2Jsb2NrX3VybDwvc3Ryb25nPiAtIGh0dHA6Ly93d3cud29vdGhlbWVzLmNvbTwvbGk+PGxpPjxzdHJvbmc+d29vX2Jsb2c8L3N0cm9uZz4gLSBmYWxzZTwvbGk+PGxpPjxzdHJvbmc+d29vX2Jsb2djYXQ8L3N0cm9uZz4gLSAvYmxvZy88L2xpPjxsaT48c3Ryb25nPndvb19jYXRfbWVudTwvc3Ryb25nPiAtIGZhbHNlPC9saT48bGk+PHN0cm9uZz53b29fY29udGFjdDwvc3Ryb25nPiAtIGNvbnRhY3Qtd2lkZ2V0PC9saT48bGk+PHN0cm9uZz53b29fY3VzdG9tX2Nzczwvc3Ryb25nPiAtICNyc3N7DQpiYWNrZ3JvdW5kOiAjMzNhMTJjOyAvKiM2ZjljMzMgOy8qICNjZTQ5MjA7Ki8NCn0NCg0KLmdyaWRfMTZ7DQpkaXNwbGF5Om5vbmU7DQp9PC9saT48bGk+PHN0cm9uZz53b29fY3VzdG9tX2Zhdmljb248L3N0cm9uZz4gLSA8L2xpPjxsaT48c3Ryb25nPndvb19mZWF0cGFnZXM8L3N0cm9uZz4gLSAxNzU3LDE3NzIsMTc3NjwvbGk+PGxpPjxzdHJvbmc+d29vX2ZlZWRidXJuZXJfdXJsPC9zdHJvbmc+IC0gPC9saT48bGk+PHN0cm9uZz53b29fZ29vZ2xlX2FuYWx5dGljczwvc3Ryb25nPiAtIDwvbGk+PGxpPjxzdHJvbmc+d29vX2xheW91dDwvc3Ryb25nPiAtIGRlZmF1bHQucGhwPC9saT48bGk+PHN0cm9uZz53b29fbG9nbzwvc3Ryb25nPiAtIGh0dHA6Ly9zZWdhbGEuY29tL3dwLWNvbnRlbnQvd29vX3VwbG9hZHMvMy1sb2ctcG5nLWhpZ2gucG5nPC9saT48bGk+PHN0cm9uZz53b29fbWFudWFsPC9zdHJvbmc+IC0gaHR0cDovL3d3dy53b290aGVtZXMuY29tL3N1cHBvcnQvdGhlbWUtZG9jdW1lbnRhdGlvbi92aWJyYW50Y21zLzwvbGk+PGxpPjxzdHJvbmc+d29vX25hdl9leGNsdWRlPC9zdHJvbmc+IC0gMTUxNywxNTE5LDE1MjEsMTUyMywzLDE1OTUsMTYxMywxMCwxNjc2LDE3NTcsMTc3MiwxNzc2PC9saT48bGk+PHN0cm9uZz53b29fc2hvcnRuYW1lPC9zdHJvbmc+IC0gd29vPC9saT48bGk+PHN0cm9uZz53b29fc2hvd19hZDwvc3Ryb25nPiAtIGZhbHNlPC9saT48bGk+PHN0cm9uZz53b29fc2hvd19tcHU8L3N0cm9uZz4gLSBmYWxzZTwvbGk+PGxpPjxzdHJvbmc+d29vX3N0ZXBzPC9zdHJvbmc+IC0gMDEuLCAwMi4sIDAzLjwvbGk+PGxpPjxzdHJvbmc+d29vX3RhYmJlcjwvc3Ryb25nPiAtIGZhbHNlPC9saT48bGk+PHN0cm9uZz53b29fdGhlbWVuYW1lPC9zdHJvbmc+IC0gVmlicmFudENNUzwvbGk+PGxpPjxzdHJvbmc+d29vX3VwbG9hZHM8L3N0cm9uZz4gLSBodHRwOi8vc2VnYWxhLmNvbS93cC1jb250ZW50L3dvb191cGxvYWRzLzMtbG9nLXBuZy1oaWdoLnBuZzwvbGk+PC91bD4=