How would i go about using c to write a program to strip a webpage inputted in via the terminal in Linux and remove all the HTML tags and images and put the remaining content into a file.
I am just confused on how to get the webpage from the terminal.
Thanks for your help.
curl,wget,aria2cetc. Download webpage using those program use write your C program to strip tags.If you want to download webpage using C. You can use libcurl. To get sample code how to use libcurl to download
http://stackoverflow.comuse the following command.This will generate a file
downloadstackoverflow.cwhich contains sample code.This is a simple program that strips tags from html. It does not support tags in quotes
". But you should get the idea.