-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore(deps): update dependency spacy to v3.8.4 #15
base: main
Are you sure you want to change the base?
Conversation
03f9a4c
to
13d7b77
Compare
13d7b77
to
c1827b3
Compare
c1827b3
to
d12d8c6
Compare
d12d8c6
to
1b3f347
Compare
1b3f347
to
f85a417
Compare
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
6 similar comments
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
f85a417
to
1519962
Compare
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
5 similar comments
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
1519962
to
3e0073e
Compare
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
2 similar comments
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
Thanks for making updates to your pull request. Our team will take a look and provide feedback as soon as possible. Please wait for any GitHub Actions to complete before editing your pull request. If you have any additional questions or concerns, feel free to let us know. Thank you for your contributions! |
5469ac7
to
555a063
Compare
555a063
to
4686a5d
Compare
4686a5d
to
a2855e3
Compare
a2855e3
to
1008480
Compare
1008480
to
8ef3ca8
Compare
8ef3ca8
to
c488900
Compare
c488900
to
6cce1f3
Compare
6cce1f3
to
fdf7de7
Compare
fdf7de7
to
9531c81
Compare
9531c81
to
0319160
Compare
0319160
to
c5ecd53
Compare
c5ecd53
to
2cf929f
Compare
2cf929f
to
4b9939a
Compare
4b9939a
to
d8c3a9c
Compare
This PR contains the following updates:
== 3.0.6
->==3.8.4
Release Notes
explosion/spaCy (spacy)
v3.8.4
Compare Source
v3.8.3
Compare Source
v3.8.2
Compare Source
v3.8.1
Compare Source
v3.8.0
Compare Source
v3.7.6
Compare Source
v3.7.5
: : Download sanitization, Typer compatibility, and a bugfix for linking gold entitiesCompare Source
✨ New features and improvements
spacy download
(#13313).typing-extensions<5.0.0
for Python < 3.8 (#13516).🔴 Bug fixes
use_gold_ents
behaviour for EntityLinker.📖 Documentation and examples
MorphAnalysis
(#13433).👥 Contributors
@danieldk, @honnibal, @ines, @JoeSchiff, @nokados, @Paillat-dev, @rmitsch, @schorfma, @strickvl, @svlandeg, @ynx0
v3.7.4
: : New textcat layers and fo/nn language extensionsCompare Source
✨ New features and improvements
TextCatReduce.v1
layer for text classification (#13181).TextCatParametricAttention.v1
layer for text classification (#13201).build
module for creating model packages by default (#13109).benchmark speed
command (#13247).🔴 Bug fixes
Language.pipe
.Doc
.Tokenizer.explain
for special cases with whitespace.SparseLinear
layer.📖 Documentation and examples
trf_data
examples and the transformer pipeline design section.👥 Contributors
@adrianeboyd, @danieldk, @evornov, @honnibal, @ines, @lise-brinck, @ridge-kimani, @rmitsch, @shadeMe, @svlandeg
v3.7.3
Compare Source
v3.7.2
: : Fixes for APIs and requirementsCompare Source
✨ New features and improvements
__all__
fields (#13063).🔴 Bug fixes
spacy.cli.project
API.Any
comparisons forToken
andSpan
.📖 Documentation and examples
spacy-llm
including Azure OpenAI, PaLM, and Mistral support.👥 Contributors
@adrianeboyd, @honnibal, @ines, @rmitsch, @svlandeg
v3.7.1
: : Bug fix for spacy.cli module loadingCompare Source
🔴 Bug fixes
spacy.info
to fix availability ofspacy.cli
followingimport spacy
(#13040).👥 Contributors
@adrianeboyd, @honnibal, @ines, @svlandeg
v3.7.0
: : Trained pipelines using Curated Transformers and support for Python 3.12Compare Source
This release drops support for Python 3.6 and adds support for Python 3.12.
✨ New features and improvements
spacy project
commands should run as before, just now they're using Weasel under the hood.transformers
extra tospacy-transformers
v1.3 (#13025).--spans-key
option for CLI evaluation withspacy benchmark accuracy
(#12981).spacy.info
(#12962).spacy.training.example
(#12801).Language.replace_listeners
: Pass the replaced listener and thetok2vec
pipe to the callback in order to supportspacy-curated-transformers
(#12785).tqdm
withdisable=None
to disable output in non-interactive environments (#12979).📦 Trained pipelines updates
The transformer-based
trf
pipelines have been updated to use our new Curated Transformers library through the Thinc model wrappers and pipeline component from spaCy Curated Transformers.ray
extra.spacy project
has a few backwards incompatibilities due to the transition to the standalone library Weasel, which is not as tightly coupled to spaCy. Weasel produces warnings when it detects older spaCy-specific settings in your environment or project config.spacy_version
configuration key has been dropped.check_requirements
configuration key has been dropped due to the deprecation ofpkg_resources
.SPACY_CONFIG_OVERRIDES
environment variable is no longer checked. You can set configuration overrides usingWEASEL_CONFIG_OVERRIDES
.SPACY_PROJECT_USE_GIT_VERSION
environment variable has been dropped.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @bdura, @connorbrinton, @danieldk, @davidberenstein1957, @denizcodeyaa, @eltociear, @evornov, @honnibal, @ines, @jmyerston, @koaning, @magdaaniol, @pdhall99, @ringohoffman, @rmitsch, @senisioi, @shadeMe, @svlandeg, @vinbo8, @wjbmattingly
v3.6.1
: : Support for Pydantic v2, find-function CLI and moreCompare Source
✨ New features and improvements
find-function
CLI for finding locations of registered functions (#12757).spacy[cuda12x]
forcupy-cuda12x
(#12890).init config
andtrain
CLI (#12173).distutils
tosetuptools
/sysconfig
(#12853).🔴 Bug fixes
<br>
tags in displaCy.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @afriedman412, @arplusman, @bdura, @connorbrinton, @honnibal, @ines, @it176131, @pmbaumgartner, @rmitsch, @shadeMe, @svlandeg, @thomashacker, @victorialslocum, @x-tabdeveloping
v3.6.0
: : New span finder component and pipelines for SlovenianCompare Source
✨ New features and improvements
span_finder
pipeline component to identify overlapping, unlabeled spans (#12507).spacy evaluate --per-component
,Language.evaluate(per_component=True)
andScorer.score(per_component=True)
(#12540).spancat_singlelabel
inspacy debug data
CLI (#12749).PhraseMatcher
andSpanGroup
(#12642, #12714).🔴 Bug fixes
SpanGroup
spans come from the current doc.📦 Trained pipelines updates
We have added new pipelines for Slovenian that use the trainable lemmatizer and floret vectors.
sl_core_news_sm
sl_core_news_md
sl_core_news_lg
sl_core_news_trf
The English pipelines have been updated to improve handling of contractions with various apostrophes and to lemmatize "get" as a passive auxiliary.
The Danish pipeline
da_core_news_trf
has been updated to usevesteinn/DanskBERT
with performance improvements across the board.SpanGroup
spans are now required to be from the same doc. When initializing aSpanGroup
, there is a new check to verify that all added spans refer to the current doc. Without this check, it was possible to run into string store or other errors.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @bdura, @danieldk, @davidberenstein1957, @diyclassics, @essenmitsosse, @honnibal, @ines, @isabelizimm, @jmyerston, @kadarakos, @KennethEnevoldsen, @khursani8, @ljvmiranda921, @rmitsch, @shadeMe, @svlandeg, @tomaarsen, @victorialslocum, @vin-ivar, @ZiadAmerr
v3.5.4
: : Bug fixes for overrides with registered functions and sourced components with listenersCompare Source
✨ New features and improvements
🔴 Bug fixes
👥 Contributors
@adrianeboyd, @bdura, @honnibal, @ines, @svlandeg
v3.5.3
: : Speed improvements, bug fixes and moreCompare Source
✨ New features and improvements
spancat
, in particular on GPU (~10x-30x faster) (#12577).>+
,>-
,>++
,>--
) for the dependency matcher (#12528).doc.spans
for displaCy output inspacy benchmark accuracy
/spacy evaluate
(#12575).MorphAnalysis.get(default=)
argument for user-provided default values similar todict
(#12545).🔴 Bug fixes
#egg
from download URLs due to future deprecation inpip
.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @andyjessen, @bdura, @davidberenstein1957, @diyclassics, @honnibal, @ines, @kadarakos, @KennethEnevoldsen, @ljvmiranda921, @moxley01, @royashcenazi, @svlandeg, @tanloong, @victorialslocum
v3.5.2
: : Pretraining improvements, bug fixes for spans and spancat and moreCompare Source
✨ New features and improvements
spacy pretrain
(#12435).model-last.bin
forspacy pretrain
(#12459).Span
input fordisplacy.parse_deps
(#12477).cupy
install extras.🔴 Bug fixes
Span.sents
.spancat_singlelabel
.Span.sents
when the final sentence is the last token in aDoc
.Span.kb_id
andSpan.id
strings inDoc
andDocBin
serialization.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @BLKSerene, @honnibal, @ines, @kadarakos, @prajakta-1527, @rmitsch, @shadeMe, @sloev, @svlandeg, @thomashacker, @willfrey
v3.5.1
: : spancat for multi-class labeling, fixes for textcat+transformers and moreCompare Source
💥 We'd love to hear more about your experience with spaCy! Take our survey here.
✨ New features and improvements
spancat_singlelabel
pipeline component for multi-class and non-overlapping span classification. Thespancat_singlelabel
component predicts at most one label for each suggested span and adds a new settingallow_overlap
to restrict the output to non-overlapping spans (#11365).transformer
+ CNN for efficient GPUtextcat
withspacy init config
(#11900).spacy debug data
(#11419).>+
,>-
,<+
,<-
) (#12334).spacy.PlainTextCorpusReader.v1
for plain text input (#12122).alignment_mode
andspan_id
toSpan.char_span()
(#12145, #12196).🔴 Bug fixes
top_k>1
in trainable lemmatizer.test_cli_find_threshold()
test more robust.registry.find()
.Matcher
patterns with extension attributes.grc
to languages with lexeme norms inspacy-lookups-data
.KnowledgeBase
instances configurable.auto_select_port
.InMemoryLookupKB.is_empty
.Lexeme.orth
andLexeme.lower
.PretrainVectors
.pkg_resources
.📖 Documentation and examples
👥 Contributors
@adrianeboyd, @andyjessen, @danieldk, @essenmitsosse, @honnibal, @ines, @itssimon, @kadarakos, @kwhumphreys, @ljvmiranda921, @pmbaumgartner, @polm, @richardpaulhudson, @rmitsch, @shadeMe, @svlandeg, @tanloong, @thomashacker, @victorialslocum
v3.5.0
: : New CLI commands, language updates, bug fixes and much moreCompare Source
✨ New features and improvements
apply
CLI command to annotate new documents with a trained pipeline (#11376).benchmark
CLI command to benchmark pipelines. The newbenchmark speed
subcommand measures the speed of a pipeline, thebenchmark accuracy
subcommand is a new alias forevaluate
(#11902).find-threshold
CLI command to identify an optimal threshold for classification models (#11280).FUZZY
Matcher
operator for fuzzy matches based on Levenshtein edit distance. In addition, theFUZZY
andREGEX
operators are now supported in combination withIN
/NOT_IN
. (#11359).typer
v0.7.x (#11720),mypy
0.990 (#11801) andtyping_extensions
v4.4.x (#12036).spacy.ConsoleLogger.v3
with expanded progress tracking (#11972).textcat
withspacy.textcat_scorer.v2
(#11696 and #11971) andspacy.textcat_multilabel_scorer.v2
(#11820).InMemoryLookupKB
(#11268).before_update
callback that is invoked at the start of each training step (#11739).SpanGroup
(#11380).displacy.serve
when the default port is in use (#11948).tok2vec
version (#11618).🔴 Bug fixes
tok2vec
ortransformer
layer.textcat
.Vocab.to_disk
respects the exclude setting forlookups
andvectors
.SpanGroup
andSpan
objects.The following changes may require you to update code that is using the relevant functionality:
textcat
ortextcat_multilabel
model - ensure that values are 0.0 or 1.0 as explained in the docs.KnowledgeBase
is now an abstract class, you should call the constructor of the newInMemoryLookupKB
instead when you want to use spaCy's default KB implementation. If you've written a custom KB that inherits fromKnowledgeBase
, you'll need to implement its abstract methods, or alternatively inherit fromInMemoryLookupKB
instead.The following changes may influence the output of your language pipeline or trained models:
pymorphy3
(#11345, #11811).tok2vec
defaults in all components (#11618).textcat
andtextcat_multilabel
components (#11698).textcat
andtextcat_multilabel
to fix a bug related tothreshold
fortextcat
and to make it possible to score multipletextcat
/textcat_multilabel
components in a single pipeline with custom scorers. If no custom scorers are used, thecat_p/r/f
scores will now only reflect the final component's labels and performance (#11696, #11820).token_acc
score to report the intended measure (# correct tokens / # predicted tokens
, the same as in spaCy v2). Thetoken_acc
scores for v3.5 will be lower for the same performance because they were incorrectly inflated in v3.0-v3.4. Thetoken_p/r/f
scores should remain unchanged (#12073).The following functionality will be changed in the near future - so it's best to start updating your scripts now to make them more generic:
master
branch tomain
.📦 Trained pipelines updates
IS_SPACE
as atok2vec
feature fortagger
andmorphologizer
components to improve tagging of non-whitespace vs. whitespace tokens.spacy-transformers
v1.2, which uses the exact alignment fromtokenizers
for fast tokenizers instead of the heuristic alignment fromspacy-alignments
. For all trained pipelines exceptja_core_news_trf
, the alignments between spaCy tokens and transformer tokens may be slightly different. More details about thespacy-transformers
changes in the v1.2.0 release notes.📖 Documentation and examples
biluo_to_iob
andiob_to_biluo
functions.👥 Contributors
@aaronzipp, @adrianeboyd, @albertvillanova, @ArchiDevil, @cfuerbachersparks, @damian-romero, @danieldk, @darigovresearch, @DSLituiev, @essenmitsosse, @gremur, @honnibal, @ines, @jmyerston, @JosPolfliet, @kadarakos, @koaning, @kwhumphreys, @ljvmiranda921, @MarcoGorelli, @orglce, @pmbaumgartner, @polm, @richardpaulhudson, @rmitsch, @ryndaniels, @shadeMe, @svlandeg, @thomashacker, @TrellixVulnTeam, @wannaphong, @zhiiw, @zrpxx
v3.4.4
: : Bug fixes and future NumPy compatibilityCompare Source
This bug fix release is primarily to avoid deprecation warnings and future incompatibility with NumPy v1.24+.
🔴 Bug fixes
spancat
for docs with zero suggestions.smart_open
requirement and update deprecated options.spacy init config --gpu
for environments withoutspacy-transformers
.EditTreeLemmatizer
.👥 Contributors
@adrianeboyd, @danieldk, @honnibal, @ines, @polm, @svlandeg
v3.4.3
: : Extended Typer support and bug fixesCompare Source
✨ New features and improvements
🔴 Bug fixes
EntityLinker
.Doc.to_json()
for attributes set by getters.pipeline_package.load()
.spacy project
requirements checks for unsupported specifiers and requirements lines.spacy.load(disable=)
that could enable currently disabled components.👥 Contributors
@aaronzipp, @adrianeboyd, @honnibal, @ines, @polm, @rmitsch, @ryndaniels, @svlandeg, @thomashacker
v3.4.2
: : Latin and Luganda support, Python 3.11 wheels and moreCompare Source
✨ New features and improvements
spacy.ConsoleLogger.v2
optionally saves training logs to JSONL (#11214).DependencyMatcher
to include matching parents or children to the left or the right of the node (#10371).cuda11x
andcuda-autodetect
(usingcupy-wheel
) (#11279).Doc.to_json()
andDoc.from_json()
(#11125).enable
anddisable
options forspacy.load()
more consistent (#11459).disable
/enclude
/exclude
forspacy.load()
(#11406).--url
flag forspacy info
to print the direct download URL for a pipeline (#11175).spacy project
CLI (#11226).spacy debug data
CLI for spancat data (#11504).spacy_version
inspacy package
metadata (#11552).spacy project assets
(#11458).spacy pretrain
command (#11210).natto-py
for theko
extra (#11222).📦 Trained pipelines updates
This release includes updated English pipelines for spaCy v3.4 with improved NER performance. The updates in
en_core_web_*
v3.4.1 address issues related to training from data with partial named entity annotation, which led to lower NER recall in English pipeline versions v3.0.0–v3.4.0. In particular, entities that appear in the sections of the OntoNotes training data without NER annotation were not predicted consistently by the earlier pipeline versions, such as names and places that are frequent in the Biblical sections, e.g., "David" and "Egypt" (see #7493).Use
spacy download
to update your English pipelines to the newest version. If you'd prefer to keep using an earlier version, you can specify the version directly with e.g.spacy download -d en_core_web_sm-3.4.0
. You can check that you are using the new version (v3.4.1) withspacy validate
:🔴 Bug fixes
SetPredicate
.Doc.__init__
.pymorphy2_lookup
lemmatizer mode for Russian and Ukrainian.Doc
type, an error will now be raised (#11424).📖 Documentation and examples
spacy.models_and_pipes_with_nvtx_range.v1
callback.Example
API documentation.displacy
docs.spacy project dvc
.spacy-wordnet
.initialize()
function for pipeline components.👥 Contributors
@adrianeboyd, @bdura, @danieldk, @diyclassics, @DSLituiev, @GabrielePicco, @honnibal, @ines, @JulesBelveze, @kadarakos, @ljvmiranda921, @ninjalu, @pmbaumgartner, @polm, @radandreicristian, @richardpaulhudson, @rmitsch, @shadeMe, @stefawolf, @svlandeg, @thomashacker, @tobiusaolo, @tzussman , @yasufumy
[
v3.4.1
](https:/Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.