7 questions
-
1) I have the following HTML in a post:
<time class="entry-date published" datetime="2015-12-22T23:52:37+00:00">December 22, 2015</time>I can use the ginner to get “December 22, 2015” but that is not what I want. I want to get the value of the attribute datetime. I would like to get “2015-12-22T23:52:37+00:00”. How can I get this, i.e. get an attribute’s value?
If I could use regular expressions in the replace function then I could get it very easily like this:
replace|{line}|<time class="entry-date published" datetime="(.*)">.*</time>|$1|Please consider allowing regular expressions in the replace. It would be ***very, very*** useful and powerful.
2) In WordPress.com RSS feeds, they do not use <enclosure> but include the WordPress categories and tags assigned to a post. Both are coded as:
<category><![CDATA[... category name goes here ...]]></category>Is there anyway to get ‘<![CDATA[… category name goes here …]]>` from the RSS feed?
Then once I have that, if I could use regular expressions in the HTML parser replace function I could extract “… category name goes here …” very easily, like this:
replace|{line}|<![cdata[(.*)]]>|$1|
3) I’m still confused about the meta processor. Can you give an example of how it is used and explain exactly what information it used for?
4) Also for the metakey destination. Can you give an example of where its information goes in the post?
5) I still don’t understand, what is “fix time”. I read the documentation here:
https://docs.google.com/document/d/1DOwgYcMJnzFdybhQN7n1CdZVK1bSa2ZHqbO1CumpjUs/edit?pref=2&pli=1#And it says:
“This Option will set a lag time between time article being published in the source and the time outputted by obGrabber RSS Reader Engine. By default it is set to -24, meaning the outputted time will be 24 hours earlier than the publish time of your original article.”
But I don’t understand what this means. Can you give an exact example with times.
6) Also can you explain the cache? It is unclear what is being cached and where?
7) When I use the ginner on the demo html parser to get inside an tag, it still returns the full a tag as if it was not ginned. Looks like a bug. Also how to specify an tag which has no class or id?
Thx
The topic ‘7 questions’ is closed to new replies.