When somebody clicks from another site to your site via a link, the browser usually sends you a referrer header that tells you that they came from an external source. You can see these in your log files. Most analytics software will give you are report on these as well.
Check out linkchecker—it will crawl the site (while obeying ) and generate a report. From there, you can script up a solution for creating the directory tree.robots.txt
An iframe sounds like it may be what you need, but I'm not 100% sure after reading your description. The iframe will show content from another URL in your page. Once you have that all you need to do is style the page and use CSS/JavaScript to add in your functionality as described.
Check out this: http://www.w3schools.com/tags/tag_iframe.asp
The quickest and easiest way is to use google developer tools in Google Chrome.
1st go to google developer tools. or F12 or Ctrl+Shift+Itop right Ellipsis->More Tools->Developer Tools
2nd Click on the "Network" tab.
3rd click on the "XHR" sub-tab. XHR(XMLHttpRequest)
if a site uses json it will be listed under the XHR sub-tab. You can search through the different return objects, select one and use the "preview" sub-sub-tab to view it.
View JSON
View JSON URL
Although the above way is the easiest it is not the most stable way of getting the info you need. Many sites make changes to the return data without any notice. This will probably break your app...
I think what you are looking for is an API(Application programming interface). Most web APIs return json or xml. You should start by searching for the api documentation for the specific site that you want to get json data from. Example documentation for sites that have public api feeds are github api or youtub api. Many of these will require authentication in order to get the desired json response but the documentation should show you how to do it.
Using a legitimate web api is the most stable way to go. Meaning your app has less chance of randomly breaking all of the time due to feed changes, url changes... I hope this helps!
Webmaster Tools has a feature that allows you to see which sites link to your site.
https://www.google.com/webmasters/tools/external-links
It should be a comprehensive list of all links that Google has discovered. You can also download the latest links it has found as well as other data.
I don't believe that Google provides a way to sort the inbound links for specific URLs, but you can download the csv files that Google provides and play around with it.
You can use iframe but, be sure that target website may disallow iframe embed. You can use following code;
var websites = new Array(
"http://www.hurriyet.com",
"http://www.milliyet.com",
"http://www.amazon.com"
);
var counter = 0;
var sTimeOut = setInterval(function () {
var index = counter%(websites.length);
$("#website_div").html($('<iframe src="' + websites[index] + '" width="500" height="500" border="0" scrolling="no"/>' ));
counter++;
}, 5000);
Put your websites in list, and it will rotated.
Here is a working demo: Demo