Link Analysis
Visualisation of links, by domain suffix, from the JISC UK Web Domain Dataset (1996-2010).
This visualisation shows an overview of how a subset of the sites in the JISC UK Web Domain Dataset (1996-2010) are interlinked. For each year, the corresponding chord diagram shows the percentage of links between the different second-level or top-level domains, such as the percentage of links found in *.ac.uk pages that link to *.co.uk pages.
About this visualisation
This dataset was generated from a subset of the JISC UK Web Domain Dataset (1996-2010), analysing the HTML pages and pulling out the 'href' attributes from every 'a' link. These were then aggregated by public suffix, e.g. all '*.ac.uk' counted as 'ac.uk', all '*.com' as 'com', etc. The source code used to do this is available here.
This summary data was then combined with the d3.js visualisation engine (specifically, this chord diagram) to produce the overview you can see here.
Note that the underlying data was generated using only one sixth of the total dataset, and due to the way the data was selected, the statistical siginficance of the results may be rather poor (expecially in the earliest or latest years). Further analysis will be required in order to confirm the overall trends.
If you have any questions, ideas or requests for alternative datasets, please get in touch.
Latest Instances
