Loading...
Projects

Natural Language Processing the Witcher Books 2022

This was a natural language project where I scraped data from the Witcher books and use Spacy to analyze the character relationships. Web scraping was performed on the Witcher Books Fandom page to extract all characters within the series. The book text was scraped from this repository. Spacy was used to analyze the book text and extract characters which would then be checked against the scraped character list from the web. A rolling window was used to analyze relationships, where a relationships was established whenever characters apaeared together within the window. When more than two characters appeared within the window, each character was assumed to have a relationship with all other characters in the window. Analysis was then performed and visualzations were created using NetworkX, Pyvis, and Pandas.

Networktop10
© 2024 Daniel Alas