Change Log#
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
[Unreleased]#
[1.7.0] - 2024-04-14#
Removed#
Deprecated feature document parser
mednlp_combine_medfirst_doc_parser
from resource library.
Changed#
Renamed
MeddicalFeatureDocumentParser
toMedCatFeatureDocumentParser
since the parser only adds MedCAT CUIs.Fixed the configured parser that adds both ScispaCy biomed entities and MedCAT CUIs in
mednlp_combine_biomed_doc_parser
.Separated combiner with non-combiner models in configuration space. The non-combiners (biomed ScispaCy and MedCAT) are parses that are used in their respective combiners. These are then used by a new composite parser that uses both (
mednlp_combine_biomed_doc_parser
). Unit test case added for all configured parsers.
[1.6.0] - 2024-02-27#
Added#
Added a ScispaCy biomedical document parser, which is enabled by setting
mednlp_default:doc_parser = mednlp_combine_biomed_doc_parser
. This uses better linguistic features and detects more entity span(s).
Changed#
Fixed normalize token and sentence/token indexes from being clobbered by combined medical parsers.
Fixed numeric entity feature from the combined medical parser is non-zero for medical entities.
Combined medical parser defines linguistic and medical features in the document parser object instance.
[1.5.0] - 2023-12-05#
Changed#
Upgrade libraries:
numpy
,lxml
,scikitlearn
,scipy
,medcat
.Added
scispacy
dependency.
Added#
Support for Python 3.11.
Removed#
Support for Python 3.9.
[1.4.1] - 2023-09-08#
Changed#
Fix unit tests using the model provided for the MedCAT tutorials.
Re-enable GitHub workflow unit tests CI.
[1.4.0] - 2023-08-16#
Downstream moderate risk update release.
Changed#
Switch order of medical specific and general spaCy language parsing and chunking. Now the medical parser is the source parser and the default spaCy parser is the target in
MappingCombinerFeatureDocumentParser
. This was done to get better sentence chunking as MedCAT (used in the medical parser) does not sentence chunk well as it was not designed for it.Upgrade to zensols.util 1.13.0
Fix
cui2vec
weight archive re-download on each access.Mapping combiner default is to use token’s character absolute index.
[1.3.2] - 2023-06-29#
Added#
Feature document parser shortcut from application factory.
Changed#
Resource library configuration to clean up model resources after download.
[1.3.1] - 2023-06-27#
Changed#
Updated to zensols.nlp to 1.7.2 and use its new feature to auto load any missing spaCy base model(s).
Remove configuration file requirement from the CLI.
Fix spaCy dependency requirement to align with
scipy
and zensols.nlp.
[1.3.0] - 2023-06-20#
Changed#
Fix contraction tokenization. This was done by swapping the target and source parser in the
lang.conf:mednlp_combine_doc_parser
resource library configuration.
[1.2.0] - 2023-06-09#
Changed#
Upgrade to medcat 1.7.0.
Better UTS error handling with raised exceptions and messages.
[1.1.0] - 2023-04-05#
Changed#
Install missing models and packages on start up.
Updated zensols.install to 0.2.1
Updated zensols.nlp to 1.6.0.
[1.0.0] - 2023-02-02#
Changed#
Updated zensols.util to 1.12.1.
Updated zensols.nlp to 1.5.0.
[0.1.1] - 2022-10-02#
Removed#
Make scispacy optional and remove the dependency. See the test cases in test.
[0.1.0] - 2022-10-01#
Added#
Unit test in GitHub CI.
Changed#
Upgrade to spaCy 2.2, MedCAT 3.0,
zensols.nlp
1.4.0.Make
cui2vec
a standard word embedding with vectorizer and layer.Replace entity splitter by not embedding named entities as default configuration.
Removed#
Support for Python 3.7, 3.8 from dropped support in
zensols.util
.
[0.0.2] - 2022-05-04#
Added#
A CSV features dump example.
Changed#
Use token instead of sentence level mapping in cases where MedCAT creates unaligned sentence boundaries.
Make medical parser stand-alone and use delegate mapping combiner instead of using a class hierarchy.
Entity linker is now a token decorator.
Protect against
unk
(unknown) keys in TUIs.
[0.0.1] - 2022-01-30#
Added#
Initial version.
[Unreleased]: https://github.com/Paul Landes/mednlp/compare/v1.7.0…HEAD [1.7.0]: https://github.com/Paul Landes/mednlp/compare/v1.6.0…v1.7.0 [1.6.0]: https://github.com/Paul Landes/mednlp/compare/v1.5.0…v1.6.0 [1.5.0]: https://github.com/Paul Landes/mednlp/compare/v1.4.1…v1.5.0 [1.4.1]: https://github.com/Paul Landes/mednlp/compare/v1.4.0…v1.4.1 [1.4.0]: https://github.com/Paul Landes/mednlp/compare/v1.3.2…v1.4.0 [1.3.2]: https://github.com/Paul Landes/mednlp/compare/v1.3.1…v1.3.2 [1.3.1]: https://github.com/Paul Landes/mednlp/compare/v1.3.0…v1.3.1 [1.3.0]: https://github.com/Paul Landes/mednlp/compare/v1.2.0…v1.3.0 [1.2.0]: https://github.com/Paul Landes/mednlp/compare/v1.1.0…v1.2.0 [1.1.0]: https://github.com/Paul Landes/mednlp/compare/v1.0.0…v1.1.0 [1.0.0]: https://github.com/Paul Landes/mednlp/compare/v0.1.1…v1.0.0 [0.1.1]: https://github.com/Paul Landes/mednlp/compare/v0.1.0…v0.1.1 [0.1.0]: https://github.com/Paul Landes/mednlp/compare/v0.0.2…v0.1.0 [0.0.2]: https://github.com/Paul Landes/mednlp/compare/v0.0.1…v0.0.2 [0.0.1]: https://github.com/Paul Landes/mednlp/compare/v0.0.0…v0.0.1