Assignment4

Assignment4 - Department of Computer Science The University...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
Department of Computer Science The University of Hong Kong CSIS1117A Computer Programming Assignment 4 Due Date: 23:59, Nov 15, 2009. You may assume all input are valid in these exercises. 1. Write a program to print all hyperlinks and their titles in an html file. A hyperlink is of the following format: <a href=” http://www.hku.hk ”>The University of Hong Kong</a> Note that any white space within the angular brackets (except those between quotes) is ignored, and there may be text other than the hyperlink. You can first read the entire file into a single string (reading line by line and concatenating them together). Then you can search for the substring "<a", which indicates the start of the hyperlink. Then, search for the next double quote. Extract the substring from this quote to the next quote. Then skip until ">". The title will then be the text until "</a>". Then find the next hyperlink by searching for the next "<a", until there is no more matching. You may assume the URL is always enclosed in a pair of double quote for simplicity.
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

This document was uploaded on 05/04/2011.

Page1 / 2

Assignment4 - Department of Computer Science The University...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online