From karstenj at microsoft.com Tue Aug 3 09:36:30 2010 From: karstenj at microsoft.com (Karsten Januszewski) Date: Tue Aug 3 10:40:34 2010 Subject: [uf-discuss] UfXtract .Net microformats parser open-sourced In-Reply-To: References: <36A319113CF910438942741C4727ADFF04A486AF@MOBY.Clarence.local> <1280351993.23241.1387235699@webmail.messagingengine.com> Message-ID: <2F84DBBA27E3A64E93EF2596E8F9ECAF0BAD69@TK5EX14MBXW653.wingroup.windeploy.ntdev.microsoft.com> Indeed, this is great to see made available to the community. Thanks so much, Glenn! Karsten Januszewski http://visitmix.com > On Mon, 26 Jul 2010 10:53 +0100, "Glenn Jones" > wrote: >> Hi All >> >> I have just open-sourced UfXtract .Net microformats parser. ?With a few >> lines of code you can load and parse microformats from Urls or HTML >> strings. ?You can then extract the data directly in .Net or convert it >> into JSON, JSON-P or XML. >> >> UfXtract currently supports the following microformats hCard, hCalendar, >> hReview, hResume, hAtom, XFN, rel-tag, geo, adr, rel-nofollow, >> rel-license, rel-directory, rel-home, rel-enclosure, rel-payment and >> votelinks. >> >> It also supports a handful of POSH patterns hCard-XFN, rel-me, >> rel-next/previous, test-suite and test-fixture. The support of rel-me >> and rel-next/previous was added to help people build social graph >> spiders. >> >> UfXtract can typically parse a page between 10-50ms. I have gone to some >> pains to build a test suite to make sure it conforms as closely as >> possible to the microformats specs. >> >> You can also easily create new microformats and POSH definitions using >> some simple .Net objects. >> >> API - http://ufxtract.com/ >> Documentation - http://ufxtract.com/documentation/ >> Source code - http://github.com/glennjones/ufxtract/ >> Test suite - http://www.ufxtract.com/testsuite/ >> >> Hopefully people will find it useful... >> >> Glenn Jones >> >> >> _______________________________________________ >> microformats-discuss mailing list >> microformats-discuss@microformats.org >> http://microformats.org/mailman/listinfo/microformats-discuss >> > _______________________________________________ > microformats-discuss mailing list > microformats-discuss@microformats.org > http://microformats.org/mailman/listinfo/microformats-discuss > _______________________________________________ microformats-discuss mailing list microformats-discuss@microformats.org http://microformats.org/mailman/listinfo/microformats-discuss From fberriman at gmail.com Thu Aug 12 01:50:16 2010 From: fberriman at gmail.com (Frances Berriman) Date: Thu Aug 12 01:50:56 2010 Subject: [uf-discuss] Re: Microformats panel at SXSWi 2011 In-Reply-To: References: Message-ID: Our panel is in the picker: http://panelpicker.sxsw.com/ideas/view/5584 (I've already asked them to correct the spelling of their microformats tag :( ) Please do give it a thumbs up. Also, as I alluded to before, I really expect this to be a community panel representing us as a whole, and as such, I am more than welcome to receive your suggestions for things you want to see, don't want to see, people you've been inspired by recently, good site implementations, tricky problems etc etc. I'll do my best to co-ordinate this session as best covers what people want to see, and as always, will include Q&A time (assuming this gets into the final selection, of course, which is why your vote is important). I recommend popping ideas in as comments on the proposal so that non-mailing list readers can benefit. Thanks, Also came across this session which might be of interest to people here: http://panelpicker.sxsw.com/ideas/view/5613 On 21 June 2010 22:01, Frances Berriman wrote: > Hi people, > > SXSWi has horrendously early deadlines for panel proposals every year, > so despite the fact that we're only in June, I'm thinking ahead to > next March and am considering putting forward a microformat themed > panel for 2011. > > I wanted to send out an email to people to ask two things: > > 1. Does anyone have items they'd like to see in a panel discussion. > Topics of particular interest. ?Projects they themselves have worked > on and would consider coming to Texas to have a talk about. etc. > > As Tantek mentioned to me today, there's been loads of recent events - > such as the ever growing selection of google rich snippets appearing > in search terms - and various websites making the best use of these > (hRecipe springs to mind immediately). The topics around the future of > microformats in HTML5 and beyond and also he also mentioned RelMeAuth > (which I haven't yet investigated for myself, but am doing right now). > ?http://microformats.org/wiki/RelMeAuth > > 2. Whether someone has already put forth a panel suggestion. ?If so > ?* Let me know about the topic, so I can suggest something radically different > ?* and/or we'll put the community behind making sure an already > submitted suggestion makes it through if featured in the panel picker > come August. > > No matter who is putting forth a proposal, I'd like to see a presence > from our community in some form next year. > > Happy to have a discussion about this here - or email me privately. > Either way - please let me know your thoughts before the 30th June, so > I have time to put together the proposal before the deadline of July > 9th. > > Thanks so much, > > Frances > > > > -- > Frances Berriman > http://fberriman.com > From timstrees at googlemail.com Mon Aug 16 05:59:59 2010 From: timstrees at googlemail.com (Tim's Trees) Date: Mon Aug 16 06:00:08 2010 Subject: [uf-discuss] Trying to learn microformats Message-ID: I came across the following script, but I can not get it to be recognised as a microformat, by my browsers Chrome and Firefox, with the normal microformat extensions. Other microformats I have tried were recognised. The offending code is
Big Bad Voodoo Daddy Musical Performance featuring Big Bad Voodoo Daddy Bass Performance Hall Fort Worth TX 4th and Calhoun Streets
I can not see an obvious mistake, so any advice would be appreciated. MCL From brian.suda at gmail.com Mon Aug 16 06:46:03 2010 From: brian.suda at gmail.com (Brian Suda) Date: Mon Aug 16 06:46:07 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: Message-ID: On Mon, Aug 16, 2010 at 12:59 PM, Tim's Trees wrote: > I came across the following script, but I can not get it to be > recognised as a microformat, by my browsers Chrome and Firefox, with > the normal microformat extensions. do you have a URL so that others can test it as well? -brian -- brian suda http://suda.co.uk From andreluis.pt at gmail.com Mon Aug 16 07:39:10 2010 From: andreluis.pt at gmail.com (=?ISO-8859-1?Q?Andr=E9_Lu=EDs?=) Date: Mon Aug 16 07:39:16 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: Message-ID: Tim, most user-agent tools ignore hidden elements... So remove the display: none; and try again. :) Cheers, Andr? Lu?s On 16 August 2010 14:46, Brian Suda wrote: > On Mon, Aug 16, 2010 at 12:59 PM, Tim's Trees wrote: >> I came across the following script, but I can not get it to be >> recognised as a microformat, by my browsers Chrome and Firefox, with >> the normal microformat extensions. > > do you have a URL so that others can test it as well? > > -brian > > > -- > brian suda > http://suda.co.uk > _______________________________________________ > microformats-discuss mailing list > microformats-discuss@microformats.org > http://microformats.org/mailman/listinfo/microformats-discuss > From angus at pobox.com Mon Aug 16 07:40:12 2010 From: angus at pobox.com (Angus McIntyre) Date: Mon Aug 16 07:46:58 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: Message-ID: <66a8e08509cd2e92274b84364a0ad741.squirrel@webmail.nomadcode.com> Tim's Trees wrote: > I came across the following script, but I can not get it to be > recognised as a microformat, by my browsers Chrome and Firefox, with > the normal microformat extensions. Other microformats I have tried > were recognised. The offending code is I was able to get Operator to acknowledge the microformat in your sample. One thing that I did notice is that it didn't work until I corrected the 'fancy' quotes around your URL to simple 'straight' quotes. I don't know if the 'fanciness'(i.e. typographic quotes with different characters for close and open) was inserted by your email client or if it was present in your original source-code, but it's something you might like to check. By the way, my understanding is that setting 'display: none' on microformatted data will probably be frowned on. I don't think it's the cause of your problem (although I suppose an extension could assume that 'display: none' indicated that the author didn't want a piece of data to be taken into account) but if my reading of: http://microformats.org/wiki/principles is correct, hiding microformatted data is discouraged. Angus From martin at weborganics.co.uk Mon Aug 16 09:09:35 2010 From: martin at weborganics.co.uk (Martin McEvoy) Date: Mon Aug 16 09:09:56 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: Message-ID: <4C6962BF.8090600@weborganics.co.uk> Hello Tim, On 16/08/2010 13:59, Tim's Trees wrote: > I came across the following script, but I can not get it to be > recognised as a microformat, by my browsers Chrome and Firefox, with > the normal microformat extensions. > > Other microformats I have tried were recognised. > > The offending code is > >
> > Big Bad Voodoo Daddy > > > Musical Performance featuring Big Bad Voodoo > Daddy > > class=?url fn org? only_path=?false?>Bass Performance Hall > > Fort Worth > TX > 4th and Calhoun Streets > > >
It looks like you have some mixed quotes "" in there, if I copy and paste your example into notepad nothing seems to be amiss but if you paste into something else say dreamweaver or notepad2 you start to see the problem :) try validating. You can validate fragments of html at http://validator.w3.org/#validate_by_input+with_options , check validate html fragment, you will see what the problem is. Microform.at has a transformer that transforms microformat fragments by direct input http://microform.at/direct/ which is useful for trying new markup, test before you publish ;) Hope all that helps, good luck. -- Martin McEvoy From timstrees at googlemail.com Mon Aug 16 11:24:35 2010 From: timstrees at googlemail.com (Tim's Trees) Date: Mon Aug 16 11:24:44 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: <0015174c11e0fa356e048df4439e@google.com> References: <4C6962BF.8090600@weborganics.co.uk> <0015174c11e0fa356e048df4439e@google.com> Message-ID: Thank you all for your full and quick replies. You were right, the original was a complete mismatch of quotes and once I had fixed those it displayed in Chrome and Operator:Firefox. I apologise for not spotting that. I have noted the urls you have recommended and will investigate those first in future. I am wanting to create an event site and I had the idea, I should have my entries in hCalendar format, but I am now worrying, that it may be too easy to clone my site, with a right click. Do you have any opinions on this ? Tim On Aug 16, 2010 5:09pm, Martin McEvoy wrote: > ?Hello Tim, > > > > On 16/08/2010 13:59, Tim's Trees wrote: > > > I came across the following script, but I can not get it to be > > recognised as a microformat, by my browsers Chrome and Firefox, with > > the normal microformat extensions. > > > > Other microformats I have tried were recognised. > > > > The offending code is > > > > > > http://www.zvents.com/fort-worth-tx/events/show/79634462?> > > Big Bad Voodoo Daddy > > > > > > Musical Performance featuring Big Bad Voodoo > > Daddy > > > > http://www.zvents.com/fort-worth-tx/venues/show/35132? > > class=?url fn org? only_path=?false?>Bass Performance Hall > > > > Fort Worth > > TX > > 4th and Calhoun Streets > > > > > > > > > > > > It looks like you have some mixed quotes "" in there, if I copy and paste your example into notepad nothing seems to be amiss but if you paste into something else say dreamweaver or notepad2 you start to see the problem :) try validating. > > > > You can validate fragments of html at http://validator.w3.org/#validate_by_input+with_options , check validate html fragment, ?you will see what the problem is. > > > > Microform.at has a transformer that transforms microformat fragments by direct input http://microform.at/direct/ which is useful for trying new markup, test before you publish ;) > > > > Hope all that helps, good luck. > > > > -- > > Martin McEvoy > > > From martin at weborganics.co.uk Mon Aug 16 15:12:07 2010 From: martin at weborganics.co.uk (Martin McEvoy) Date: Mon Aug 16 15:12:24 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: <4C6962BF.8090600@weborganics.co.uk> <0015174c11e0fa356e048df4439e@google.com> Message-ID: <4C69B7B7.4010903@weborganics.co.uk> On 16/08/2010 19:24, Tim's Trees wrote: > Thank you all for your full and quick replies. You are welcome .... > You were right, the > original was a complete mismatch of quotes and once I had fixed those > it displayed in Chrome and Operator:Firefox. I apologise for not > spotting that. :) > I have noted the urls you have recommended and will investigate those > first in future. > > I am wanting to create an event site and I had the idea, I should have > my entries in hCalendar format, but I am now worrying, that it may be > too easy to clone my site, with a right click. Do you have any > opinions on this ? Im a little unsure of what you mean if you mean by cloning perhaps you mean spoofing? ( copying a website possibly for fraud such as phishing or email-spoofing ) it doesn't really happen *too* much in the real world unless your site is a bank or it offers online payments in some way (e.g. PayPal), In which case I wouldn't worry to much about that. Having said all that Social Networking sites (Facebook/MySpace) are becoming targets for these kind of attacks nowadays. If you are worried about people copy and pasting from your website, unless its copyrighted material, again don't worry too much, Id take that as a compliment, the majority of people who *do* copy and paste tend to be just learning. If its for anything else the stuff they are copying will never do them any good as far as search engines are concerned because *you* published the data *first*. Some search engines (google) will actually remove pages that contain duplicate content from their listings, or it will bury the duplicate content so deep in their listings that there is no way anyone will ever see it anyway. The rule of thumb concerning microformats is, If you use microformats on your website you can expect your data to be shared, crawled and Indexed by practically anything that can consume microformats, If you don't want this to happen, say because your data is private or something sensitive, then don't use microformats. Dont let that last part put you off though, sharing your data, particularly events and contact details, *is* a good thing. Hope all that helps rest your mind a little. Best wishes -- Martin McEvoy From timstrees at googlemail.com Tue Aug 17 00:52:31 2010 From: timstrees at googlemail.com (Tim's Trees) Date: Tue Aug 17 00:58:18 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: <4C69B7B7.4010903@weborganics.co.uk> References: <4C6962BF.8090600@weborganics.co.uk> <0015174c11e0fa356e048df4439e@google.com> <4C69B7B7.4010903@weborganics.co.uk> Message-ID: Martin, Thank you for your detailed reply. I had hoped my site would be a comprehensive list of events within a certain category. Let us say Craft Fairs and because of the searching I would provide, a user could select all Craft Fairs in the Cornwall area for the next 12 months. If each event was an hCalendar, then they could right click to add all to a Google Calendar and publish that Google Calendar. If I had hoped to generate some sort of advertising revenue from my original site, then this would potentially dilute that income stream. They could similarly data-scrape the page and reformat and publish, but that would be more difficult. So my fear was/is, am I making it too easy to acquire my lists or should I take it as a compliment and not worry. I was also looking at doing something similar with hListing for a classifieds' site. Many thanks again for your excellent reply. Tim On 16 August 2010 23:12, Martin McEvoy wrote: > ?On 16/08/2010 19:24, Tim's Trees wrote: >> >> Thank you all for your full and quick replies. > > You are welcome .... > >> You were right, the >> original was a complete mismatch of quotes and once I had fixed those >> it displayed in Chrome and Operator:Firefox. I apologise for not >> spotting that. > > :) > >> I have noted the urls you have recommended and will investigate those >> first in future. >> >> I am wanting to create an event site and I had the idea, I should have >> my entries in hCalendar format, but I am now worrying, that it may be >> too easy to clone my site, with a right click. Do you have any >> opinions on this ? > > Im a little unsure of what you mean if you mean by cloning perhaps you mean > spoofing? ( copying a website possibly for fraud such as phishing or > email-spoofing ) it doesn't really happen *too* much in the real world > unless your site is a bank or it offers online payments in some way (e.g. > PayPal), ?In which case I wouldn't worry to much about that. Having said all > that Social Networking sites (Facebook/MySpace) are becoming targets for > these kind of attacks nowadays. > > If you are worried about people copy and pasting from your website, unless > its copyrighted material, again don't worry too much, Id take that as a > compliment, the majority of people who *do* copy and paste tend to be just > learning. If its for anything else the stuff they are copying will never do > them any good as far as search engines are concerned because *you* published > the data *first*. Some search engines (google) will actually remove pages > that contain duplicate content from their listings, ?or it will bury the > duplicate content so deep in their listings that there is no way anyone will > ever see it anyway. > > The rule of thumb concerning microformats is, If you use microformats on > your website you can expect your data to be shared, crawled and Indexed by > practically anything that can consume microformats, If you don't want this > to happen, say because your data is private or something sensitive, then > don't use microformats. Dont let that last part put you off though, sharing > your data, particularly ?events and contact details, *is* a good thing. > > Hope all that helps rest your mind a little. > > Best wishes > > -- > Martin McEvoy > > From send.missive at coreymwamba.co.uk Tue Aug 17 20:05:07 2010 From: send.missive at coreymwamba.co.uk (Corey Mwamba) Date: Tue Aug 17 20:05:19 2010 Subject: [uf-discuss] microformats as HTML5 microdata - example discussion Message-ID: <201008180405.08230.send.missive@coreymwamba.co.uk> Hello, I was reading http://microformats.org/wiki/microdata and did a small bit of thinking around the use of the item/itemprop attributes. I actually put a couple of examples in the wiki itself - but thought it would be polite to post here as well and ask for any feedback or potential issues. Apologies if this has been discussed before. My examples were for an hCard: Corey Mwamba

Corey Mwamba

56 Nowhere Road

Nowhere

NO1 6QT

And an hCalendar: Web 2.0 Conference
Web 2.0 Conference: - , at the Argent Hotel, San Francisco, CA
I was wondering: 1. would the profile for microformats have to be re-written for a parser to be able to pick up the itemprops? 1a. How hard is that to do? 1b. Would it be worth it? 2. Can the use of item attributes scale to the potential complexity of microformats? My initial thoughts were that it could - as stated on the page you could use itemref to include another item - but perhaps someone more technically minded can see an issue? All the best, C. From martin at weborganics.co.uk Wed Aug 18 09:42:28 2010 From: martin at weborganics.co.uk (Martin McEvoy) Date: Wed Aug 18 10:11:48 2010 Subject: [uf-discuss] Trying to learn microformats In-Reply-To: References: <4C6962BF.8090600@weborganics.co.uk> <0015174c11e0fa356e048df4439e@google.com> <4C69B7B7.4010903@weborganics.co.uk> Message-ID: <4C6C0D74.1060801@weborganics.co.uk> Hello Tim, On 17/08/2010 08:52, Tim's Trees wrote: > Martin, > > Thank you for your detailed reply. > > I had hoped my site would be a comprehensive list of events within a > certain category. > > Let us say Craft Fairs and because of the searching I would provide, a > user could select all Craft Fairs in the Cornwall area for the next 12 > months. > If each event was an hCalendar, then they could right click to add all > to a Google Calendar and publish that Google Calendar. If I had hoped > to generate some sort of advertising revenue from my original site, > then this would potentially dilute that income stream. Not necessarily, If your site was just publishing events, then that is no bad thing you can always provide advertising along side the event data, the hope is that you will attract *more* visitors because you have published your data in a "take away" format i.e. hCalendar. > They could similarly data-scrape the page and reformat and publish, > but that would be more difficult. Indeed .... > So my fear was/is, am I making it too easy to acquire my lists or > should I take it as a compliment and not worry. Making your events/lists easy to acquire is a good thing on the whole, by doing so you are not only making it easy for the average person to do something with your data, you are also making it easy for search engines to store your data and include in their listings. Other sites may also want to do something with your data, but again don't worry too much about that, unless they are copying your entire website of course ;) Its the nature off the web these days to syndicate/re-publish data somewhere else. > I was also looking at doing something similar with hListing for a > classifieds' site. Good Idea :) > Many thanks again for your excellent reply. > No problem. Martin McEvoy > Tim > > On 16 August 2010 23:12, Martin McEvoy wrote: >> On 16/08/2010 19:24, Tim's Trees wrote: >>> Thank you all for your full and quick replies. >> You are welcome .... >> >>> You were right, the >>> original was a complete mismatch of quotes and once I had fixed those >>> it displayed in Chrome and Operator:Firefox. I apologise for not >>> spotting that. >> :) >> >>> I have noted the urls you have recommended and will investigate those >>> first in future. >>> >>> I am wanting to create an event site and I had the idea, I should have >>> my entries in hCalendar format, but I am now worrying, that it may be >>> too easy to clone my site, with a right click. Do you have any >>> opinions on this ? >> Im a little unsure of what you mean if you mean by cloning perhaps you mean >> spoofing? ( copying a website possibly for fraud such as phishing or >> email-spoofing ) it doesn't really happen *too* much in the real world >> unless your site is a bank or it offers online payments in some way (e.g. >> PayPal), In which case I wouldn't worry to much about that. Having said all >> that Social Networking sites (Facebook/MySpace) are becoming targets for >> these kind of attacks nowadays. >> >> If you are worried about people copy and pasting from your website, unless >> its copyrighted material, again don't worry too much, Id take that as a >> compliment, the majority of people who *do* copy and paste tend to be just >> learning. If its for anything else the stuff they are copying will never do >> them any good as far as search engines are concerned because *you* published >> the data *first*. Some search engines (google) will actually remove pages >> that contain duplicate content from their listings, or it will bury the >> duplicate content so deep in their listings that there is no way anyone will >> ever see it anyway. >> >> The rule of thumb concerning microformats is, If you use microformats on >> your website you can expect your data to be shared, crawled and Indexed by >> practically anything that can consume microformats, If you don't want this >> to happen, say because your data is private or something sensitive, then >> don't use microformats. Dont let that last part put you off though, sharing >> your data, particularly events and contact details, *is* a good thing. >> >> Hope all that helps rest your mind a little. >> >> Best wishes >> >> -- >> Martin McEvoy >> >> -- Martin McEvoy