Lex Rank approach for text summarization.
- sumy
- spacy
- neologdn
en_core_web_sm: A spaCy english multi-task CNN trained on OntoNotes.punkt: NLP sentence tokenizer
- Setup a
python 3.xvirtual environment. Activatethe environment- Install the dependencies using
pip3 install -r requiremnts.txt- Install C++ compiler if
neologdnis triggeringwheelerrors.
- Install C++ compiler if
- Setup the models by running the following commands,
$ python -m spacy download en_core_web_sm
$ python -c "import nltk; nltk.download('punkt')"- Run the
main.pyfile - Enter the source path.
Results can be found here.