Analyze News and Pull Data from Texts

Add to wishlistAdded to wishlistRemoved from wishlist 14


Analyze News and Pull Data from Texts
Analyze News and Pull Data from Texts



NLP projects for intermediates

Named Entity Extraction Project with spaCy



Among Practity’s Python data science projects and NLP projects for intermediates, this particular challenge is known as Named Entity Recognition. Your task is to to locate and classify named entity mentions in unstructured text into pre-defined categories such as the person names, organizations, locations and so on.
You will be  given a single text file  containing multiple news articles. The goal of this Python project is to scan each article for the names of people and print results following several conditions such as sorting, whitespaces, blank lines, etc.
Additionally, you are requested to implement functions to count, modify and add characters to the results.


Robert Utterback
Robert is Assistant Professor of Computer Science at Monmouth College (US).
He worked as research intern and software engineer at Huawei and Cerner corporation.
Robert holds a PhD of Computer Science (Washington University in St. Louis) and Bachelor of Science
Mathematics, Computer Science.


After purchase, you will receive an email with a ZIP folder. It is also available on your Practity account.
The file includes the follwoing files:

  1. One PDF with the project description along with hints to guide you during the project.
  2. A “.py” file with the project solved. It contains not only the source code but also detailed explanations and comments about how the code works. For specific topics, links are provided to online tutorials.
  3. Two “.txt” files with the output of the code and the dataset with the news articles.


– Libraries: you will have to work with the next libraries: Spacy, re, datetime.
– Python functions, loops (for) and  conditional statements (if/else).
– Read csv, txt files (with open()), list comprehensions and dictionaries.

– Basic Regular Expressions
– Argparse when running from the Linux command line.

Python 3

If you need additional information, do not hesitate to contact us.


Reviews (3)

3 reviews for Analyze News and Pull Data from Texts

5.0 out of 5
Write a review
Show all Most Helpful Highest Rating Lowest Rating
  1. lou_bot (verified owner)

    I bought this project with the hopes that it would allow me to improve my NLP skills and knowledge. I’m happy to say that it ended up being exactly what I was looking for.

    Helpful(0) Unhelpful(0)You have already voted this
  2. Tim Dow (verified owner)

    A great project to practice and learn Natural Language Processing

    Helpful(0) Unhelpful(0)You have already voted this
  3. Dante (verified owner)

    If you have some basic experience with Python and really want to learn text processing, this practice is for you

    Helpful(0) Unhelpful(0)You have already voted this

    Only logged in customers who have purchased this product may leave a review.

    Python and Excel Projects for practice
    Register New Account
    Shopping cart