How to use LanguageDetector() from spacy_langdetect package?

Question

I'm trying to use the spacy_langdetect package and the only example code I can find is (https://spacy.io/universe/project/spacy-langdetect): It's throwing error: nlp.add_pipe now takes the string name of the registered component factory, not a callable component. So I tried using the below for adding to my nlp pipeline But this gives error: Can't find factory for 'language_detector' for language English (en).

Accepted Answer

With spaCy v3.0 for components not built-in such as LanguageDetector, you will have to wrap it into a function prior to adding it to the nlp pipe. In your example, you can do the following:import spacyfrom spacy.language import Languagefrom spacy_langdetect import LanguageDetectordef get_lang_detector(nlp, name):    return LanguageDetector()nlp = spacy.load("en_core_web_sm")Language.factory("language_detector", func=get_lang_detector)nlp.add_pipe('language_detector', last=True)text = 'This is an english text.'doc = nlp(text)print(doc._.language)For built-in components (i.e. tagger, parser, ner, etc.), see: https://spacy.io/usage/processing-pipelines

Advertisement

Answer