Skip to content

archaeo_super_prompt.modeling.entity_extractor.ner_transformer

source module archaeo_super_prompt.modeling.entity_extractor.ner_transformer

The pipeline Transformer related to the remote NER model.

Classes

  • NerModel Transformer adding identified NamedRecognition features for each chunk.

source class NerModel(allowed_ner_confidence=0.7)

Bases : BaseTransformer

Transformer adding identified NamedRecognition features for each chunk.

Instantiate the Named Entity Recognition model.

Environment variables

The NER_MODEL_HOST_URL env var must be set with the base url of the remote model for the named entity recognition (e.g. 'http://localhost:8004')

Methods

source method NerModel.transform(X: PDFChunkDataset)DataFrame[EntitiesPerChunkSchema]