Exploring Google (Toolbox Activity 1)
Written by: Jonathan Briggs
September 29, 2009 [3465 views]
Deadline: Midnight, Monday 5th October 2009
In this first Toolbox activity you need to spend at least a couple of hours really trying to put Google through its paces.
Questions to think about
- What makes Google the most popular search engine in the world?
- How can you improve the results you get for a search?
- How can you use Google to get help when things go wrong?
- What else is Google doing besides search?
- How can you keep in touch with developments at Google?
- How does Google work and what is going on behind the scenes? (Part 2)
Read this first
You should always print out these instructions
Every activity is broken up into at least 2 parts. You need to make sure that you always complete part 1 and if you are trying to get a good result for the module you should also complete part 2.
Every activity comes with things to do and things to think about. At the end of each part of the activity there is a feedback form that allows you to prove that you have completed the work. The feedback form will not test every part of the activity and will also allow you to ask questions and report problems.
Group feedback will be provided if you complete the activity before the stated deadline.
You should always make sure that you make notes (on paper or in a word processor) during the activity so that it is easy to fill in the feedback form. This also ensures that if the feedback form does not work for some reasons you can go back and fill it in again.
The software used to power the form is called SurveyGizmo and is highly reliable.
Do remember that failure to take part in the activities will result in you having to take an examination (based on the same material).
Every activity will come with a list of useful links that should help you explore the questions asked. You may not need to use all of them but you may find them helpful for these activities and other parts of your course.
Do remember that these activities must be completed individually. This does not prevent group discussion or collaborative planning of the activities but each student must complete the feedback surveys on their own.
We reserve the right to interview students whose answers are identical to other students and if copying or plagarism is suspected both parties may be required to take the examination. A quality control check will be applied to all submissions and only those students who meet that threshold will have passed.
Before you start
- Download a copy of the Firefox broswer from http://en.www.mozilla.com/en/firefox/
We will use Firefox for many of the activities in this module because it works the same way across
different platforms and provides many useful extensions and tools. - Download and install the Google Toolbar http://www.google.com/tools/firefox/toolbar/FT3/intl/en/.
This is the first additional tool that you must install. Follow the instructions and restart Firefox to activate the toolbar.
Part 1: Going beyond “I feel lucky”
You are already familiar with Google but are you using all of its tools to get the best out of the results? Here are 4 things that you need to do:
1. How many results?
- Set your Google preferences so that you are getting 100 results per page
- Choose a popular pair of keywords such as “kanye west” and look at the number of results on www.google.com and www.google.co.uk. Make a note of the numbers. Why do
you think they are different - Look at the first 100 results - are these the best results possible?
- What about the order of these results. Make a note of the URL for the top natural result?
- Notice the advertising. Make a note of the company who is advertising along side these natural results.
- Page forward until you reach the 1000th result. What do you notice? Are you surprised? Check to see whether you get similar answers using Yahoo! and Bing.
2. Advanced searching
Google will let you refine your search either using the advanced search functionality, through using keywords in the query or by using tools on the results page.
- Work out how to find pages about search engine spiders that have been published in the uk in the last 6 months
3. Different versions of google
Take a look at scholar.google.com. This is an academic version of google that prioritises academic papers. Repeat some of the searches you have done so far.
4. Google labs
Take a look at Google labs http://www.googlelabs.com/and the range of new services that Google is developing. Take a look at Google blog search http://blogsearch.google.com/
Here are the specific questions you will have to answer in the feedback questionaire. For each answer you have to type a couple of sentences maximum.
Please Note: Writing "don't know", N/A, filling with rubbish or similar will prevent you from passing the quality control check.
Answers required
- How many search results did you find for “Kanye West” on the UK and US versions of Google?
- Why do you think these might be different? (Have a guess if you don’t know)
- Who is advertising along side results for “Kanye West”?
- What was special about the results beyond the first 1000?
- Why do you think that this is the case? Think carefully about technical, business and other issues that may be involved.
- How did you use Google to find articles about search engine spiders that had been published in the UK in the last 6 months? Describe what you did in full?
You will pass this activity if you give sensible (not necessarily correct answers) to the above questions.
Feedback your results to PART 1 using this survey
http://www.surveygizmo.com/s/176056/toolbox-1-1
Part 2: How does Google work?
Part 2 activities will generally be more open ended and require you to do some research on your own. In this activity we want you to find out as much as you can about how Google works.
Use Google to research and then write notes for each of the following:
- What does a search engine spider do?
- How can Google return its answers so quickly?
- What sorts of computers and software is Google using to provide its service?
- How does Google decide which pages to show at the top of its results?
Take this short quiz selecting the best answer for each question:
Answers required
Google
a. searches the web whenever a user presses the search button
b. searches an internal version of the web
c. searches an internal index of the web
Spiders
a. visit every page on the web
b. collect data from every page on the web
c. collect data from some pages on the web
Page Rank
a. is a measure of how relevant the contents of a page are to a search query
b. is a measure of how popular a page is with other web sites
c. is a measure of how much a web site has paid Google to be listed
Results
a. are shown according to relevance
b. are shown according to a measure of reputation
c. are shown according to relevance and reputation
Number of servers for Google
a. 1
b. 10 - 100
c. 100 - 1000
d. 1000+
Finally try to answer the following questions
- How would you explain to a none technical person how Google works?
- What would be required to build a search engine that is better than Google?
- What questions do you have about Google? (list at least 3)
You will pass this activity if you give sensible (not necessarily correct answers) to the above questions.
Feedback your results to PART 2 using this survey
http://www.surveygizmo.com/s/176070/toolbox-1-2
Useful links
Google Guide
Recommended Search Engines(University of Berkeley)
The Anatomy of a Large-Scale Hypertextual Web Search Engine
Google in Wikipedia
Google PageRank Basics (YouTube)
Sergey Brin and Larry Page: Inside the Google machine (Video)
Recent comments:
What do you think?
On September 29, 2009 at 7:43 PM, Rahim wrote:
How do we submit the Activities when we complete them
Jonathan replies: If you read the text above you will find two links to forms to submit your answers. They are marked FEEDBACK YOUR ANSWERS