Releases: spencermountain/wtf_wikipedia
8.1.2
track changes to covid templates
8.1.1
bugfix for table parser
8.1.0
8.1.0
- [major] fix Link json object in
.json()
result - [major] fix inconsistent response for singular method aliases like
.template('foo')
- [major] change in rowspan behaviour to support covid table
- support
<noinclude>
- add
.url()
and.language()
methods -
- support setters on Link methods
-
- add
Link.href()
method
- add
-
- support proper urls for interwiki links
- replicate wikipedia behaviour for apostrophe-s after link
- new plugins summary, classify, category, and i18n.
- Link hrefs are not titlecased anymore by default
8.0.0
8.0.0
- [breaking] move
.html()
,.latex()
, and.markdown()
to their respective plugins -
- drop header/footer boilerplate from outputs
- [breaking]
.templates()
and.links()
return Template and Link objects, and not bare JSON (use.map(l=> l.json())
) - [breaking] refactor inputs for
.fetch()
-
- no longer support 'enwikiquote' etc format as input
-
- use 'wiki' instead of undocumented 'wikiUrl' param
-
- no more automatic throttling/rate-limiting
- [breaking] remove
Image.exists()
method to plugin - [major] create seperate client/server-side build formats (use native fetch/node lib)
- [major] support deep (infinite) recursion in templates
- [major] much-stronger i18n support
- no-longer automatically titlecase links
- support adding template parsers through plugins in
.extend()
-
- support array, number, and string shorthand for template parsers
- deprecate
.plaintext()
in favour of.text()
7.2.10
7.1.0
- some template fixes
- add a 'number' field in sentence json, when it looks like a number
- slight change in coordinate result format, support inline coordinate text
- handle fetching a large list of titles in sequence
7.1.1
- support population, weatherbox templates
7.2.0
- improved date templates, bugfixes
7.2.9
- few more sports templates,
- rowspan parsing fix
- no-longer include package.json in builds
- use full template-parser for image captions
- support manually setting doc.title()
7.2.10
- improved unicode support for sentence/paragraph splitting
- supporting more formatting templates, like Mono
- more flexible reference support in
.json()
7.0.0
6.0.0 🚨
- support
.paragraphs()
⚠️ major changes to output of.json()
. cleaning-up redundant data.:warning:-
- remove top-level
templates
data (found insection
) - resume it with{templates:true}
- remove top-level
-
- remove top-level
coordinates
data (found intemplates
) - resume it with{coordinates:true}
- remove top-level
-
- remove top-level
citations
data (found insection
) - resume it with{citations:true}
- remove top-level
- return empty arrays in
.json()
again ¯_(:/)_ /¯ - remove
title on html output
- change ambiguous
options.title
for sections tooptions.headers
- support lists of 1
- begin removing empty references section by default
- begin support for rendering citations at the bottom of documents
- begin first-class references-parsing as objects at paragraph-level
- use this:
.citations()
-->.citations().map(c => c.json());
- use this:
- remove
.wikitext()
and.reparse()
methods - keeping wikitext stateful caused too many issues - turn
Image.file
into a function - include
interwiki()
results in.links()
- support
follow_redirects
option to fetch - hide object data in console.logs
- move ALL image urls from
upload.wikimedia.org/wikipedia/commons
towikipedia.org/wiki/Special:Redirect/file/
via #86 - image captions are now Sentence objects
- rename citation → reference internally, and in json output
- remove references inside section titles
6.1.0
- titlecase internal link destinations #192
6.2.0
- support categories in redirects
- add mongo-encoding from dumpster-dive
6.3.0
- support way (+20%?) more templates.
7.0.0
- change result-format in a lot of templates, for more consistency.
-
- notably: reference format, see also, IPA, main
- support colspan/rowspan in tables (a little!)
- support implicit first-row headers for some tables
- return templates even if they have no data
- begin support for some well-used
{{foo start}}...{{foo end}}
templates - remove empty
[]
for some more section properties in.json()
response
5.3.1
last stable release before v6
from changelog:
5.1.0
improved support for gallery tag
more support for wiktionary grammar templates
tweak some regexes
5.2.0
make .json() results return proper json for tables
5.3.0
add infobox html back into html output (tentative)
redirect support in .json(), .html() output
remove empty [] properties in .json() results (saves disk space!)
keep # anchor data in .links()
show links default-on in latex output, like in md and html
render html/latex/json 'soft redirect', instead of blank pages
5.0.0
3.0.0
- move
.parse()
to mainwtf()
method - allow repeated processes without a pre-parse of the document
- wtf.fetch() uses promises, and native
fetch()
method (when available) - allow per-section images, lists, tables + templates
- section depth values now start at 0
- infobox values now return sentence objects
- latex output (thanks @niebert!)
- refactor shell scripts to
wtf_wikipedia Toronto --plaintext
- use babel-preset-env cause it's new-new
- update deps
3.1.0
- improved .json() results
- guess a page's title based on bold formatting in first sentence
- make section.title a function
4.0.0
- 🚨 non-api changing, but large result-format change
- add
.wikitext()
method to Document, Section, Sentence (thanks @niebert) - move infobox, citation parser/data to Section class
.templates()
are now an ordered array, instead of an object, and include infoboxes and citations- add (early) support for 'generic' key-value template parsing
- normalize/lowercase template/infobox properties - add loose
.get('key')
method to Infobox class - mess-around with citation-template formatting
- beginning to support unknown template forms
- move
date
data from Sentence to Section object. - rollback of awkward+undocumented
options
param in parser (but keep options param for output methods) - add support for about a hundred new templates
- templates, including citations, try to be flat-text, and no-longer return Sentence objects
4.1.0
- remove repeated/redundant text in
.links()
results - don't automatically titlecase link srcs anymore
4.2.0
- return a result or undefined for
sentences.bolds(0)
, and the like
4.2.2
- support dollar templates
4.5.0
- support
section(0).wikitext()
- support inline {{marriage}} template
- dangling semi-colons in first-sentence parentheses
4.6.0
<gallery>
tag support in.images()
- support pageids again in .fetch()
- better disambiguation-page detection in english
- remove wikitext from caption titles
- support 3-level templates (whew!)
5.0.0
- new
Table
class andList
classes - improved table-parser - generate name
col1
instead ofcol-0
- support
options.verbose_template
for debugging - support recursive tables
1.0.0
breaking change with 0x, sections are now formatted as an array of objects, with depth information.
tables are parsed into an array of key-value pairs.
options object is removed.
all is refactored