Google helping to find out duplicate data

Posts Tagged ‘data’

Google helping to find out duplicate data

Monday, November 24th, 2008

Duplicate content in various web pages provide a serious threat to search engines. Search engines will normally have to face a serious challenging question on which pages have to be shown as search results to users. They don’t want to show more than one web pages with same content to the users. 

It is here that a new patent from Google has come to help content creators. This will help creators to identify if their content is already present in the internet in some other web pages. This patent is being designed with the help of contents being stored in specific system. When user submits content, Google will look for similar and matching contents. If any such are present in their store, this will be intimated to the users. 

The patent by Google will provide details to users on where to search for duplicate content similar to their content. This will help users to check if their content is being used in some other places without authorization. Though there are services like copyscape, this will not consider video and audio images when checking for duplicate data. It is here that this patent by the search engine giant will gain high importance. This will also help to find out duplicate data that are being copied from offline sources as well.

If you enjoyed this post, make sure you subscribe to my RSS feed!

Google to help CDC

Sunday, November 16th, 2008

Google has mentioned that with the support and help of user queries connected with flu symptoms, they will be able to predict the trends, ascending or descending, of the diseases, throughout USA. The wing of Google developed for this will give users a daily update of the trends of flu, through daily estimates of the cases.

GFT will be collecting the details directly from the users. The main intension of creating GFT was that, sick people will be depending on internet as a means of knowing more about the disease, as compared to normal people. Hence any outbreak will be immediately reported by GFT, thereby helping CDC to record and process data further. This will be really helpful for hospitals to prepare in such cases of a large influx of patients.

When this service could be extended to other diseases also, this could largely benefit as an advanced tool in knowing the real trend and statistics collected from real time environment. The technology used by Google for this could easily be extended to other medical conditions, thereby making it able to prevent the spreading of various other diseases. Most of the diseases will hence be predicted by this method in the first place, since tracking using this technology and methods are more effective and useful.

If you enjoyed this post, make sure you subscribe to my RSS feed!

Universal search patent for Google

Monday, November 10th, 2008

Google has always been trying to get something different that makes browsing a different experience with their engine. A recent update from their side is the patent the company had obtained on universal search.

As per the new patent, visitors will be able to get search results coming from different categories. This includes various areas such as news, images, ads and web pages, when they type the word for searching. 

There are various advantages of this universal search concept. The most important one is the improved search experience felt by the searchers. They need not have to worry about obtaining results from various categories. 

Before the launch of this patent, users continued to use the default search results, without using the category tab provided by the search engine. This has always limited the chances provided by the company for the users. 

In this new concept, Google maintains different databases for various categories. When a query word comes, the engine searches for relevant information in each of the databases. This will be ranked. This rank is then used for comparison to know the relevance of the information. Finally the data will be shown back to the users. Through such an advanced indexing, users will be able to get details from the most relevant categories, rather than looking for this in the results displayed

If you enjoyed this post, make sure you subscribe to my RSS feed!

Google analytics

Wednesday, October 22nd, 2008

After their attempt to a little touch up, Google analytics offers an important upgrade. They are offering a better approach to:
• AdSense integration
• Motion Charts
• advanced segmentation
• API
• custom reports
• updated user interface

Though being hailed as an upgrade, the great news is it’s free. These tools can be used by all Google analytics users. The idea behind this move is to make the expensive and difficult to use tools gain larger acceptance. In this new approach, they are free and easy to use. This adsense integration lets you link your adsense and analytics account after which you can view all adsense data. Now you can evaluate and watch your account closely and determine changes in:

• total revenue
• impressions
• clicks
• click-through ratio
• revenue per day,
• revenue per hour
• revenue per page
• per pages profitability
• revenue per referral
• data on sites that bring in profitable traffic

If you’re wondering how to get around to using these tools which may seem a little complicated, there’s good news. Google has made available videos to aid users in their quest to master the use of their integration tools. Things will take a while to be used by all but the beginning is here.

If you enjoyed this post, make sure you subscribe to my RSS feed!

Cloud storage for webmasters

Tuesday, September 30th, 2008

The introduction of new cloud storage from Parascale is latest news in the webmaster field. The data as per this is not kept on the physical devices; instead internet is used to store the details.  

Company has already announced the storage of data on a high security system. The central control servers with metadata only have access to the network nodes. There is no human interference in the network in a virtual mode. Operations can be performed without the need for a peer confirmation.  

Cloud can be upgraded according to the requirements of the clients. The software from parascale can be used to bring together the servers from various vendors together.  

Only a limited number of clients are allowed access for this storage. The data can be loaded on Linux as well as Windows server, including old servers as well.   

The cost of having this cloud storage is very less and there is no additional cost for the bandwidth provided for customers. This will make it convenient for the clients as compared to their competitors using other modes of storage facilities. Service providers can go for this storage without having to go for technology changes to have this storage facility with them.  

If you enjoyed this post, make sure you subscribe to my RSS feed!

Google going to reduce IP retention time

Monday, September 22nd, 2008

Google in a recent blog from their office has declared that, they are going to anonymize the IP of the logs in their search engine servers in 9 months. This has been an update on the earlier declaration of the retention period of 18 months.

This change has been made by the company in an attempt to protect the data and privacy of the users. The search engine giant has been closely researching this and has even accepted a request to keep a link for privacy policy on their home pages.

The main concern of the Google engineers have been for how to find out a way in which they can reduce the loss of data quality in the logs and at the same time, bring down the retention period of the IPs. This data is being used by the company to improve the search results and conduct the advertisement campaigns based on the location details as well as language preferences of their users.

Even when the methods for anonymization will have to face changes, it will be successful in bringing down the retention period. So, they have been able to devise methods that preserve the data utility and anonymize the IP address as well.

If you enjoyed this post, make sure you subscribe to my RSS feed!