archaeo_super_prompt.modeling.entity_extractor.ner_transformer
source module archaeo_super_prompt.modeling.entity_extractor.ner_transformer
The pipeline Transformer related to the remote NER model.
Classes
-
NerModel — Transformer adding identified NamedRecognition features for each chunk.
source class NerModel(allowed_ner_confidence=0.7)
Bases : BaseTransformer
Transformer adding identified NamedRecognition features for each chunk.
Instantiate the Named Entity Recognition model.
Environment variables
The NER_MODEL_HOST_URL env var must be set with the base url of the remote model for the named entity recognition (e.g. 'http://localhost:8004')
Methods
source method NerModel.transform(X: PDFChunkDataset) → DataFrame[EntitiesPerChunkSchema]