Microformats 2
Jessica Lynn Suttles
A Ruby gem for parsing HTML documents containing microformats.
Before installing and using microformats-ruby, you'll want to have Ruby 2.4.10 (or newer) installed. It's recommended that you use a Ruby version management tool like rbenv, chruby, or rvm.
microformats-ruby is developed using Ruby 2.7.1 and is additionally tested against versions 2.4, 2.5, 2.6, 2.7, 3.0, and 3.1 using github Actions.
If you're using Bundler to manage gem dependencies, add microformats-ruby to your project's Gemfile:
source 'https://rubygems.org'
gem 'microformats', '~> 4.0', '>= 4.2.1'
…and then run:
You may also install microformats-ruby directly using:
An example working with a basic h-card:
source = '
Jessica Lynn Suttles
collection.to_hash
collection.to_json
collection.card.name #=> "Jessica Lynn Suttles"
Below is a more complex markup structure using an h-entry with a nested h-card:
source = '
Jessica Lynn Suttles
collection = Microformats.parse(source)
collection.entry.name.to_s #=> "Microformats 2"
collection.entry.author.name.to_s #=> "Jessica Lynn Suttles"
collection.entry.author.name #=> "Jessica Lynn Suttles" collection.entry.properties.author.properties.name.to_s #=> "Jessica Lynn Suttles"
_
instead of -
to return property valuescollection.entry.author.first_name #=> "Jessica" collection.rel_urls #=> {}
Using the same markup patterns as above, here's an h-entry with multiple authors, each marked up as h-cards:
source = '
Jessica Lynn Suttles
Brandon Edens
collection = Microformats.parse(source)
collection.entry.author.name #=> "Jessica Lynn Suttles" collection.entry.author(1).name #=> "Brandon Edens"
:all
collection.entry.author(:all).count #=> 2 collection.entry.author(:all)[1].name #=> "Brandon Edens"
microformats-ruby also includes a command like program that will parse HTML and return a JSON representation of the included microformats.
microformats http://tantek.com
The program accepts URLs, file paths, or strings of HTML as an argument. Additionally, the script accepts piped input from other programs:
curl http://tantek.com | microformats
Status | Specification or Parsing Rule |
---|---|
✅ | Parse a document for microformats |
✅ | Parsing a p- property |
✅ | Parsing a u- property |
✅ | Parsing a dt- property |
✅ | Parsing an e- property |
✅ | Parsing for implied properties |
✅ | Nested properties |
✅ | Nested microformat with associated property |
✅ | Nested microformat without associated property |
✅ | Recognize dynamically created properties |
✅ | Support for rel attribute values |
✅ | Normalizing u-* property values |
✅ | Parse the value class pattern |
✅ | Recognize vendor extensions |
✅ | Support for classic microformats |
❌ | Recognize the include pattern |
Have questions about using microformats-ruby? Found a bug? Have ideas for new or improved features? Want to pitch in and write some code?
Check out CONTRIBUTING.md for more on how you can help!
The microformats-ruby logo is derived from the microformats logo mark by Rémi Prévost.
microformats-ruby is written and maintained by:
microformats-ruby is dedicated to the public domain using the Creative Commons CC0 1.0 Universal license.
The authors waive all of their rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law. You can copy, modify, and distribute the work, even for commercial purposes, all without asking permission.
See LICENSE for more details.