Releases: tidyverse/rvest
Releases · tidyverse/rvest
rvest 0.3.1
- Fix invalid link for SSA example.
- Parse
<options>
that don't have value attribute (#85). - Remove all remaining uses of
html()
in favor ofread_html()
(@jimhester,
#113).
rvest 0.3.0
- rvest has been rewritten to take advantage of the new xml2 package. xml2
provides a fresh binding to libxml2, avoiding many of the work-arounds
previously needed for the XML package. Now rvest depends on the xml2
package, so all the xml functions are available, and rvest adds a thin
wrapper for html. - A number of functions have change names. The old versions still work,
but are deprecated and will be removed in rvest 0.4.0.html_tag()
->html_name()
html()
->read_html()
html_node()
now throws an error if there are no matches, and a warning
if there's more than one match. I think this should make it more likely to
fail clearly when the structure of the page changes.xml_structure()
has been moved to xml2. Newhtml_structure()
(also in
xml2) highlights id and class attributes (#78).submit_form()
now works with forms that use GET (#66).submit_request()
(and hencesubmit_form()
) is now case-insensitive,
and so will find<input type=SUBMIT>
as well as<input type="submit">
.submit_request()
(and hencesubmit_form()
) recognizes forms with
<input type="image">
as a valid form submission button per
http://www.w3.org/TR/html-markup/input.image.html
rvest 0.2.0
New features
html()
andxml()
pass...
on tohttr::GET()
so you can more
finely control the request (#48).- Add xml support: parse with
xml()
, then work with usingxml_node()
,
xml_attr()
,xml_attrs()
,xml_text()
andxml_tag()
(#24). xml_structure()
: new function that displays the structure (i.e. tag
and attribute names) of a xml/html object (#10).
Bug fixes
follow_link()
now accepts css and xpath selectors. (#38, #41, #42)html()
does a better job of dealing with encodings (passing the
problem on toXML::parseHTML()
) instead of trying to do it itself
(#25, #50).html_attr()
returns default value when input is NULL (#49)- Add missing
html_node()
method for session. html_nodes()
now returns an empty list if no elements are found (#31).submit_form()
converts relative paths to absolute URLs (#52).
It also deals better with 0-length inputs (#29).
rvest 0.1.0
Initial release