Monday, January 11, 2010

HOW GOOGLE WORKS

How Google Works

Google runs on a distributed network of thousands of computers.. Parallel processing is a method of computation in which many calculations can be performed simultaneously, significantly speeding up data processing. Google has three distinct parts:

• Googlebot, a web crawler that finds and fetches web pages.
• The indexer that sorts every word on every page and stores the resulting index of words in a huge database.
• The query processor, which compares your search query to the index and recommends the documents that it considers most relevant.

1. Googlebot
Googlebot is Google’s web crawling robot, which finds and retrieves pages on the web and hands them off to the Google indexer.
Googlebot consists of many computers requesting and fetching pages much more quickly than we can with our web browser. Googlebot can request thousands of different pages simultaneously. Googlebot finds pages in two ways: through an add URL form, www.google.com/addurl.html, and through finding links by crawling the web.
To keep the index current, Google continuously recrawls popular frequently changing web pages at a rate roughly proportional to how often the pages change. Such crawls keep an index current and are known as fresh crawls. Newspaper pages are downloaded daily, pages with stock quotes are downloaded much more frequently. Of course, fresh crawls return fewer pages than the deep crawl. The combination of the two types of crawls allows Google to both make efficient use of its resources and keep its index reasonably current.

2. Google’s Indexer
Googlebot gives the indexer the full text of the pages it finds. These pages are stored in Google’s index database. This index is sorted alphabetically by search term, with each index entry storing a list of documents in which the term appears and the location within the text where it occurs. This data structure allows rapid access to documents that contain user query terms.

3. Google’s Query Processor
The query processor has several parts, including the user interface (search box), the “engine” that evaluates queries and matches them to relevant documents, and the results formatter.
PageRank is Google’s system for ranking web pages.Google considers over a hundred factors in computing a PageRank and determining which documents are most relevant to a query, including the popularity of the page, the position and size of the search terms within the page.
Google also use machine-learning techniques to improve its performance automatically by learning relationships and associations within the stored data. For example, the spelling-correcting system uses such techniques to figure out likely alternative spellings.
Let’s see how Google processes a query.





This article is taken from http://www.googleguide.com
For more information on how Google works, take a look at the following articles.
Google’s page on Google’s Technology, www.google.com/technology/.
• How does Google collect and rank results?, www.google.com/newsletter/librarian/librarian_2005_12/article1.html.
• Google’s PageRank Algorithm and How it Works, www.iprcom.com/papers/pagerank/
• Google’s PageRank Explained and How to Make the Most of It, www.webworkshop.net/pagerank.html

Monday, January 4, 2010

Google Marketing Mix

GOOGLE 5P's


Google is a search engine. Search engines are used to search the Internet. However Google is much more than a search engine - it's a global company that specializes in innovation and technology. The business focuses on information made up mainly from web pages, although today all information is absorbed by the Google sponge including books, videos and music.
Google's search engine indexes billions of pages and gives the search speedy results. The engine ranks websites organically regarding links into a page as a positive endorsement or vote. So if people like your pages they will link to them and the page will get a better rank than sites with fewer in-links.
Google was started in 1998 by Larry Page and Sergey Brin with an initial investment of $100,000. The company went public in 2004 and both founders did very nicely thank you (and became billionaires overnight). At that time the duo employed around 7000 people and grew at a tremendous rate, with some claiming that Google was the fastest growing internet company in the world. In 2008 revenues were more than $21 billion and net profit was $4 billion.

Product

• Google's income is made through advertising. When a consumer types in a keyword such as 'computer' the search engine will display the results - as it would for any search term. However you will notice that at the top and/or along the right hand side of the results, there are a series of advertisements. These advertisements are paid for by companies. The advertising program is called Google AdWords.
• Google has a relationship with a number of libraries around the world. One of its goals is to digitize as many books as possible and to include them in search result. This could mean that all books are available to everybody. The key problem with this initiative (apart from the enormity of the task) is that Google does not own the rights to all books - the writers and the copyright owners do, and they are not happy.
• Google is the world's most popular search engine.
• Google Earth enables users to view the world from space. That's a real opportunity for you and me to experience something that our ancestors never did. However there could be security implications. Like any information - it can be used for good or bad. Anyway you'll notice that the pictures are often dated and taken some time ago. Privacy is also an issue - do you want a satellite taking pictures of your home for the world to see? In 2009 they launched a revised version of Google Earth which includes the opportunity to view 3D oceans.
• Critics argue that Google is a tool for plagiarism(illegal use) . Plagiarism is essentially cheating by passing off the work of others as your own when submitting assessments at school, college and university. It is the same as copying and is often punished.
• Google Scholar - which supports a broad trawl of material such as peer reviewed journals, theses and other academic material.
• iGoogle - a personalized Google page.
• The ever evolving list of products includes Google finance, Google news, Google blog search, Google video, YouTube, Google sites, Blogger, Orkut, Google Reader, Google Groups, Google Calendar and Google Docs.
• In 2008 Google Chrome was launched. Google Chrome is an open source browser.

Price

• How does Google make money? Through a special advertising program called AdWords. AdWords are keyword-based advertisements that are bought by companies. So if you have a company that distributes computer, you would bid against other distributors of computer for the highest place. By bidding for lucrative keywords this raises the price and Google make money. It's rather like selling a rare item on eBay; the rarer it is the more money you make; the more bidders that compete for the item the more money you make. Hence the more valuable a keyword the more it will make. Advertisers are making more than their investment in advertising, and this makes it an appealing program for business. It is measurable using basic software so advertisers can work out how much they are making on their investment, which is more complex to do with traditional advertising media.

Place

• The company is located at Mountain View in California. The site looks very much like a university campus with gyms and cafes. The environment enables employees to maximize their time. The Googleplex is the name given to its HQ.
• Another way of looking at place is that Google is an online business. it distributes using an the internet as its channel.

Promotion

• Google uses AdWords itself. Often you'll see adverts with a link to Google's own services.
• They include flyers inside business magazines.
• Google Chrome has its own TV advert.
• Google has a Public Relations function that it uses to proactively manage media.
• Google will sponsor a $30 million competition for an unmanned lunar landing. The winner must land a rover on the moon; the rover should travel 500 metres, and then send back a video to Earth.

Process

• Google retains your search term. It collects data on searches to help to refine the search algorithm. So don't think that you search anonymously. Google keeps your search terms and can link them to the address of your computer, and then to you. Whilst Google may not wish to spy on you, governments may take an interest in searching habits and this is a civil liberties issue.
• If you use Google mail (Gmail) or Google calendar then you are giving even more information about yourself to Google.
• Google co-operates with the Chinese government in its censorship of certain search terms and results. Is it becoming a political animal, or just maximizing a business opportunity?

Physical Evidence

• The name Google means a number followed by 100 zeros. However the founders mistakenly registered Google as their domain name.
• The company is located at Mountain View in California (see 'place').

People

• In 2008 Google employed 20,000 people.
• Many of the original employees of Google came from Stanford and other elite US universities. It employs the top brains, and people like working together Google's innovative business culture. Employees are encouraged to take advantage of 20% time - that's one day every week working on their personal pet project. They play sports at lunchtime, with Larry and Sergey enjoying roller hockey in the early days.