docx files contain a lot of data but the data can be hard to extract and use if you want to perform some specific operations on that data, like web scraping , database insertions

You can use docx2txt for reading docx data into text format

Now all the data is in the form of list of strings, and you can save the data in either json or csv file

Only the code

Study more about regular expressions in python here

Chirag Taneja

Going through my college life, and I know studies are hard lol. Big fan of people who write amazing answers on stackoverflow

