Tag Archives: Google

Oi, Google! NO!!

I’ve been seeing what looks like a new annoying Google search “feature” for a few weeks. I have been trying to ignore it in the hope that it would go away but it hasn’t. The problem is that Google has started giving me long lists of YouTube videos for some of my queries, even though I am in web search. For example a search on comfrey compost tea came up with about a dozen videos before giving me web pages with text describing the benefits of comfrey compost, which was what I wanted. In addition, in the menus on the left hand side of the screen Google offered me options to refine my video search by duration. But, Dear Google, I did NOT want videos at all!

Google search for comfrey compost tea

It did not matter whether or not I was signed in to my Google account. The videos were still given priority. I wondered if this was just an issue with Chrome so I switched to Firefox. The list of videos disappeared and was replaced by just one entry for YouTube at the top.

Comfrey Compost Tea in Firefox and Incognito

This gave me a clue as to what might be going on. I use Chrome for most of my “personalised” search. I generally stay logged in to my account, have enabled web search history and do not clear out the search cookies. In contrast I use Firefox for “de-personalised” search. I stay logged out of Google and social networks, and cookies and history are cleared after each session. I usually watch permaculture and gardening videos in Chrome, which probably explains why YouTube was taking pride of place in many of my search results. To test the theory I paused and deleted my web search history, and cleared cookies and browsing data. I then signed out of Google, cleared cookies again and re-ran the search. The blasted videos were still there.

What if I ran the search in a Chrome incognito window? The results were identical to those when using Firefox. Back to a normal Chrome window and the videos returned. I then checked that my web history was off and deleted. It wasn’t and it steadfastly refused to go away. Then the penny dropped. All my Chrome data – bookmarks, history etc – are synced to my Google account so no matter how often I try and delete the stuff locally it will all come back down again from my account. I disconnected my Google account under Chrome’s settings and, “Hey presto”, no more videos. I reconnected and they were back. It appears that if you are using Chrome and have synced it with your Google account you will get personalised results, even if you are signed out of your account.

So, if you are a Chrome user you may think that you have switched off personalisation by logging out of your account but that may not be the case. If you are conducting serious research it is always worth running your searches in an Incognito window, using a different browser or a completely different search engine like DuckDuckGo (http://duckduckgo.com/).

Postscript: I forgot to mention that I also tried Verbatim, but to no avail. Verbatim makes sure that all your terms are in the pages/documents exactly as you have typed them in but that still gives Google plenty of leeway in presenting those results. Google still bombarded me videos although some were different from my original search.

Million Short: unearthing stuff hidden in the dungeons of Google’s results

Fed up with seeing the same results from Google again and again? Wondering if that elusive document is buried somewhere at the bottom of Google’s 2,000,000 hits? Then get thee hence to Million Short (http://millionshort.com/). Million Short runs your search and then removes the most popular web sites from the results. Originally it removed the top 1 million, as its name suggests, but the default has changed to the top 10,000. The principle remains the same, though: exclude the more popular sites and you could uncover a real gem. The page that best answers your question might not be well optimised for search engines or might cover a topic that is so “niche” that it never makes it into the top results. Million Short does not say what it uses for search results or how it determines what are the most popular web sites. According to Webmonkey “Sanjay Arora, founder of Exponential Labs, tells Webmonkey that Million Short is using “the Bing API… augmented with some of our own data” for search results. What constitutes a “top site” in Million Short is determined by Alexa and Million Short’s own crawl data.” (http://www.webmonkey.com/2012/05/million-short-a-search-engine-for-the-very-long-tail/).

Using Million Short is straightforward. Type in your search and select how many sites you want to exclude (top 10K, top million, top 100). The results page includes a list of the sites that have been removed and you can opt to add one or more back in. You can also block a site using a link next to it in the results or click on “Boost!” so that pages from the site go to the top.

Million Short results

Million Short automatically tries to detect which country you are in but you can change it under “Manage Settings and Country”. I didn’t notice much difference when I changed countries but then most of the queries I pass through Million Short tend to be scientific or technical. On the same page you can manage sites that you have blocked, added or boosted.

Does it work? I would not use it instead of the existing major search engines such as Google, Bing or DuckDuckGo but as an additional tool to surface material that is not easily found in the likes of Google. As well as web search there are image and news searches, but I’m not convinced that I’d find those all that useful.

If you are interested in comparing Million Short with Google try Million Short It On at http://www.millionshortiton.com/index.html. I had several goes at this and most of the results were a draw. That is no surprise as the searches I ran were very specific and I wanted to see if Million Short would pull up additional information, which it did. Million Short won outright on a couple and Google on one. The Google win was by default because Million Short did not come up with anything for comparison (the search in question was biofuels public transport carbon emissions).

There are a number of techniques that you can use to improve Google results for example changing the order of the words in your search, Verbatim, filetype or Reading Level but I would also recommend trying Million Short. The results should at least be different and may reveal vital information for your research.

But it’s all free on Google

This is the presentation I am giving tomorrow (27th September 2012) at the East of England Information Services Group of CILIP at the Bury St Edmunds Library. The event is “Information integrity on a decreasing budget”. Many of the slides are just images so I am not really giving that much away by releasing the slides early. And before anyone asks, no, I will be not providing notes and I will not be providing a version with a voice over – at least for free. Some will soon be made available in the resurrected Search Strategies section of my web site at http://www.rba.co.uk/search/index.shtml and in the subscription area only.

The straightforward, no notes, no voice over slides are available at

http://www.rba.co.uk/as/ – available for a few weeks only

http://www.authorstream.com/Presentation/karenblakeman-1549834-39-free-google/

http://www.slideshare.net/KarenBlakeman/but-its-all-free-on-google

Top search tips from North Wales

August is usually a quiet month for me with respect to work. Time for a holiday away and then a couple of weeks ambling along the Thames Path or pottering around the garden. This year, though, as soon as I was I back from my travels I was knuckling down and updating my notes for two search workshops in North Wales. Both were for the North Wales Library Partnership (NWLP), the first taking place at Coleg Menai in Bangor and the second at Deeside College. Both venues had excellent training facilities and IT, which meant we could concentrate on getting to grips with what Google is doing with search and experiment with different approaches to making Google do what we want it to do.

At the end of the workshops both groups were asked to come up with a list of  Top 10 Tips. I’ve combined the two lists and removed the duplicates to generate the list of 16 tips below.

  1. Repeat one or more of your search terms one or more times
    Fed up with seeing the same results for your search?  Repeat your main search term or terms to change the order of your results.
  2. Menus on left hand side of Google results pages
    Use the menus on the left hand side of the results page to focus your search and see extra search features. To see all of the options click on the ‘More’ and ‘More search tools’ links. The content of the menus changes with the type of search you are running, for example Image search has a colour option.
  3. Verbatim
    Google automatically looks for variations of your terms and no longer looks for all of your terms in a document. If you want Google to run your search exactly as you have typed it in, click on the ‘More search tools’ options at the bottom of the left hand menu on your results page and then on Verbatim at the bottom of the extended menu that appears.
  4. intext:
    Google’s automatic synonym search can be helpful in looking for alternative terms but if you want just one term to be included in your search exactly as you typed it in then prefix the word with intext:. For example carbon emissions buses intext:biofuels flintshire. The command sometimes has the effect of prioritizing pages where your term is the main focus of the article.
  5. Advanced search screen and search commands
    Use the options on the advanced search screen  or the search commands (for example filetype: and site:) in the standard search box to narrow down your search. A link to the advanced searchscreen can usually be found under the cog wheel in the  upper right hand area of the screen. If you can’t see a cog wheel or the link has disappeared from the menu go to http://www.google.co.uk/advanced_search. A list of the more useful Google commands is at http://www.rba.co.uk/search/SelectedGoogleCommands.shtml
  6. Try something different
    Get a fresh perspective by trying something different. Two most popular during these two workshops seemed to be DuckDuckGo (http://duckduckgo.com/) and Millionshort (http://millionshort.com). Other search engines to try include Bing (http://www.bing.com/) and Blekko (http://blekko.com/).
  7. Use the country versions of Google for information that is country specific
    This will ensure that the country’s local content will be given priority, although it might be in the local language. Useful for companies and people who are based in or especially active in a particular country, or to research holiday destinations. Use Google followed by the standard ISO two letter country code, for example http://www.google.de/ for Google Germany or http://www.google.no/ for Google Norway.
  8.  Filetype to search for document formats or types of information
    For example PowerPoint for experts or presentations, spreadsheets for data and statistics, or PDF for research papers and industry/government reports. Note that filetype:ppt will not pick up the newer .pptx so you will need to include both in your search, for example filetype:ppt OR filetype:pptx. You will also need to look for .xlsx if you are searching for Excel spreadsheets and .docx for Word documents. The Advanced Search screen file type box does not search for the newer Microsoft Office extensions.
  9. Clear cookies
    Even if you are logged out of your Google account when you search, information on your activity is stored in cookies on your computer. These can personalise your results according to your past search and browsing history. Many organisations have set up their IT systems so that these tracking cookies are automatically deleted at least once a day or whenever a person logs in or out of their computer account. At home, your anti-virus/firewall software may perform the same function. If you want to make sure that cookies are deleted or want to control them manually How to delete cookies at http://aboutcookies.org/Default.aspx?page=2 has instructions on how to do this for most browsers.
  10. Looking for research papers? Google Scholar (http://scholar.google.com/) is one place to look but there may be additional material hidden somewhere on an academic institution’s web site. Include advanced search commands, for example filetype:pdf site:ac.uk, in your search.
  11. For the latest news, comments and analysis on what is happening in an industry or research area carry out a  Google blog search and limit your search by date. Simply run your search as usual in the standard Google search box. On the results page click on Blogs in the menu on the left hand side of the screen and then select the appropriate time option.
  12. site: and -site:
    Use the site:command to search within a single site or type of site.For example:2011 carbon emissions public transport site:statistics.gov.uk to search just the UK official statistics web siteasthma prevalence wales site:gov.uk OR site:nhs.ukto search all UK government and NHS web sites

    If you are fed up with a site dominating your results use -site: to exclude it from your search.

    For example:

    Dylan Thomas -site:bbc.co.uk

  13. Reading level – from tourism to research
    Use this to option in the menus on the left had side of your results page to change the type of information. For example run a search on copper mines north wales. Then click on Reading Level in the left hand menus. Selecting “Basic” from the options that appear at the top of the results gives you pages on tourism and holiday attractions. “Advanced” gives you research papers, journal articles and mineral databases. Google does not give much away as to how it calculates the reading level and it has nothing to do with the reading age that publishers assign to books. It could involve sentence structure, grammar, the length of sentences on a web page, the length of the document, the terminology used and doubtless many other criteria.
  14. Google.com
    Apart from presenting your search results in a different order Google.com is where Google tries out new features. As well as seeing pages that may not be highly ranked in Google.co.uk you will get an idea of how Google search may look in the UK version in the future.
  15. Numeric range search
    Use this for anything to do with numbers – years, temperatures, weights, distances, prices etc. Use the boxes on the Advanced Search screen or just type in your two numbers separated by two full stops as part of your search.For example:world oil demand forecasts 2015..2030
  16. An understanding of copyright is important if you intend to re-use information found in the web and absolutely essential if you are going to use images. Creative Commons licences clearly state what you can and can’t do with an image but they are not all the same. The list at Creative Commons http://creativecommons.org/licenses/ outlines the terms and conditions. “FAQs – Copyright – University of Reading” at http://www.reading.ac.uk/internal/imps/Copyright/imps_copyrightfaqs.aspx gives some guidance on copyright but if in doubt always ask! An example of what can happen if you get it wrong is demonstrated by “Bloggers Beware: You CAN Get Sued For Using Pics on Your Blog” http://www.roniloren.com/blog/2012/7/20/bloggers-beware-you-can-get-sued-for-using-pics-on-your-blog.html.

 

Tales from the Terminal Room issue 102 available

Tales from the Terminal Room, issue number 102 now available at http://www.rba.co.uk/tfttr/archives/2012/july2012.shtml

Most of the information in this newsletter will already have appeared in this  blog, the exception being the Twitter Notes.

In this month’s issue:

  • Search tools
    • Yet another irritating Google feature
    • Google maps UK canals
    • Google adds cycling routes to UK maps
  • Beating Google into submission – top tips
  • Twitter Notes

Yet another irritating Google feature

There was a time when Google would aggregate pages from the same website in your search results. There might be just a couple of entries for the site with a “More from….” link next to the result.

Google More From

Alternatively you might see a mini sitemap:

Google result for UKeiG

This has the advantage that you are not swamped with results from a single website but are given instead a variety of options that might provide you with a better answer to your question.

Not any more.

You may have noticed that multiple entries from single websites have started appearing in your results. For example, rather than just one Wikipedia entry you see 4, 5, 6 or even more. On the other hand, you might not have noticed anything at all. Some of my colleagues are seeing this and some are not. Google tests new features and algorithms on a small percentage of its users to see how they react so new or test features are not seen by everyone (see How Google makes improvements to its search algorithm – YouTube http://www.youtube.com/watch?v=J5RZOU6vK4Q). As far as I’m concerned this particular “improvement” is a disaster.

I was running a very general search on the use of biofuels by public transport in the UK. I just want to get an idea of some of the issues that were being discussed before refining my search and went, by default, to Google. My first screen had nothing but results from the UK government Department for Transport (DfT).

Google search DFT

I scrolled down and saw more DfT pages. I scrolled down further and yet MORE dft pages. OK, Google, so dft.gov.uk is a good place for me to look at biofuels in public transport. I get the message. STOP! There were 27 DfT pages in total flooding the top of my results page, which I have set to display 100 entries at a time. Creeping in at number 28 came the Guardian with 5 results.

Google biofuels search Guardian

The Friends of the Earth website had 7 results, and then at last I started to see more variety in my results at around number 40, but still with a lot of repetition.

 

Google Biofuels

Google may think that the DfT is a very important source of information on the topic but I want to decide whether or not to explore more of a particular site. Spamming my results list annoys me and makes me want to go elsewhere. So I did.

DuckDuckGo (http://www.duckduckgo.com/) is my main Google alternative and it came up with a decent and varied set of results without repetition, hesitation or deviation.

Duck Duck Go biofuels search

Bing (http://www.bing.com/) and Yandex (http://www.yandex.com/) came up with similar, non-repetitive results.

Blekko (http://www.blekko.com/) came up with some interesting alternative pages for me to consider. These would not have been that useful to me in the earlier stages of my research but this test confirmed my feeling that Blekko is good at pulling up information that explores more than the mainstream issues.

Blekko results for test search

 

If you want to stay with Google how do you deal with multiple listings of sites? The most obvious approach would be to incorporate a ‘-site:’ command in your search, for example:

biofuels public transport -site:dft.gov.uk

If you are conducting in depth research and are likely to be running many variations on a search, incorporating ‘-site:’ each time can become a chore. Google’s own browser Chrome has a Personal Blocklist extension that enables you to block selected sites from results (https://chrome.google.com/webstore/detail/nolijncfnkgaikbjbdaogikpmpbdcdef). Once installed a block link appears next to each entry in your results. Click on the link to block the site from all future results. A message appears at the bottom of searches that would normally contain pages from the blocked site warning you about exclusions.

Blocklist message

The ‘show’ link displays and highlights the previously blocked pages and offers an option to unblock them.

Unblock option

Neither the -site: option nor the Blocklist approach should be necessary. There was nothing wrong with the previous ways of offering additional pages from a site in search results. It wasn’t broke but Google did break it by trying to fix it. For me, there are now several Google alternatives that produce quality results and with less irritation. I shall be using them more in future.

Google maps UK canals

First cycle routes and now canals. Google is collaborating with the Canal and River Trust to provide a Google Map guide to the UK’s canal network called In Your Area (http://canalrivertrust.org.uk/in-your-area). It is not available as part of the standard Google Maps. The map allows you to enter your address or postcode to find the nearest canal. The map shows the locations of canals, canal locks and bridges and also volunteering opportunities, places to eat and drink and boating services and moorings.

Canal and River Trust Google Map of waterways

It is early days and not everything is marked up on the map, or at least it isn’t for the Kennet and Avon Canal in Reading. Also planned for later this year is the addition of  ‘Street View’ images of the canal and river network. (Please, no lurking in the bushes by the side of the tow paths and pushing the Google cycles into the canal!)

Google adds cycling routes to UK maps

Google has added cycle routes and directions to its UK maps. The feature has been available on US and Canadian maps since 2010 but has now been extended to the Europe and Australia. In the UK Google has been working with Sustrans (http://www.sustrans.org.uk/) to include bike trails, lanes and recommended roads. Set your starting point and destination as usual and the directions area on the screen should include a bicycle icon in addition to the car, public transport and walking icons.

Google Maps cycle option

 

Select a suggested route and as well as text instructions it will be outlined in blue on the map. The “bicycling layer” also shows trails (dark green lines), dedicated lanes (light green lines) and bicycle friendly roads (dotted green lines). Google came up with two routes from my house to Reading Railway Station. The first more direct one followed the roads.

Google Maps cycle route

 

The second suggestion took the scenic route along the river, which would be more pleasant and probably safer during the rush hour.

Google Maps cycle route

 

The directions come with the usual warning that they are in beta and that you should use caution. There is an option to report unmapped bike routes, streets that aren’t suited for cycling, and other problems.

Further information is available on Google Lat Long: Biking directions expands into Europe and Australia (http://google-latlong.blogspot.co.uk/2012/07/biking-directions-expands-into-europe.html. The Guardian Bike Blog has tested out a couple of routes in London (Google Maps’ cycle routes: just how good are they?  http://www.guardian.co.uk/environment/bike-blog/2012/jul/12/google-maps-uk-cycle-routes?) and set up a Twitter hashtag #cycletest for cyclists to comment on the routes they have tried.

Beating Google into submission – top tips

Oh what fun we had beating Google into submission on June 28th! This advanced Google workshop was held at Reading University and covered some of the new goings on at Google (no more ‘ANDing’ for search terms, personalisation of results) as well as the some of the established techniques for making Google behave itself. The following is what the group came up with at the end of the day as their collaborative top tips for persuading Google to run your search the way you want it run.

1. Search settings

Use the search settings to:

  • alter the level of the safe search
  • switch on/off Google Instant. This is the feature that changes and displays results as your type in your search. Some people find it useful whilst others find it extremely irritating. A big disadvantage of it is that it only displays 10 results per page.
  • increase the number of results per page from 10 to up to 100. Since you can no longer guarantee that you will find the most relevant page in the first 10 results this enables you to view more without having to click through to the next page. Google Instant, which allows only 10 results per page, must be switched off.
  • open results in a new browser window or tab. This allows you to view results while leaving your search page intact in a separate tab or window.
  • switch on/off or edit your search history. Search history is used by Google to customise your results and some people prefer to switch it off. It can, though, provide a useful record of the searches you have carried out and the pages that you have visited from that search.

Search settings is hidden under the cog wheel which appears in the upper right hand area of your results page. In some browser versions it appears in the top right hand corner of the Google home page. The Advanced Search screen link is also hidden under the cog wheel.

Search Settings

2. Sign out of all social media and search engine accounts

Both Google and Bing are experimenting with including content from your social media connections in your search results when you are signed in to your networks. For Google, the Google.com version is where it is happening at the moment and it pulls in content from members of your Google+ circles. to see Bing’s new social sidebar, which includes content from Facebook friends and Twitter,  you have to use the US version of Bing.  The Google+ results are intermingled with the main results whereas Bing displays then in a separate sidebar on the right of the results page. For more details see Danny Sullivan’s article Bing Relaunches, Features New Social Sidebar http://searchengineland.com/the-new-bing-microsoft-tries-again-with-search-meets-social-120728

Including posts from your social network friends in your results is not always a bad thing. You may uncover valuable information and gain a different perspective on the subject of your research. There is, though, the issue of privacy. A contact in one of your Google+ circles may have posted a comment and restricted it to a circle of which you happen to be a member, so it is not public. If you want to include the information in a print out or report for a client you will have to seek permission first. Even if you try and anonymise the information there may still be enough clues to identify the source.

3. Check out Google.com as well as Google.co.uk

Apart from presenting your search results in a different order Google.com is where Google tries out new features. As well as seeing pages that may not be highly ranked in Google.co.uk you will get an idea of the future direction of Google search.

4. Limit by date.

Use the date options in the menu on the left hand side of the results page to limit your results to the last day, week, month, year or within a custom date range. This tends to work best with blogs and news sources. With ordinary web pages Google looks at the time stamp that is assigned to a page when it is uploaded, or reloaded, to the web site. This can be very different from the date on which  it was written. If you are looking for recent material it will, though, exclude pages that have been languishing untouched on a web site for years. To see the date option you have to click on the ‘More search tools’ options at the bottom of the menu.

5. Verbatim.

The essential tool for taming Google. Google automatically looks for variations of your terms, which is not always helpful. Prefixing a term with the ‘+’ sign to force an exact match no longer works in web search, but confusingly still does  in Google Scholar, and Google has suggested using double quote marks around terms or phrases instead. This does not always work.  And now Google no longer looks for all of your terms in a document. If you want Google to run your search exactly as you have typed it in, click on the ‘More search tools’ options at the bottom of the left hand menu on your results page and then click on Verbatim at the bottom. Unfortunately, Verbatim does not work with the date options but there is a solution….see number 6 below.

6. Combining with Verbatim with date limits.

There are two ways to do this: the hard way and the easy way.

First of all the hard way. This uses the ‘daterange:’ operator and Julian dates. Daterange does not understand the mm/dd/yyyy or dd/mm/yyyy date formats. You have to convert your dates to Julian date format. This is explained on the Julian Date Converter page at http://aa.usno.navy.mil/data/docs/JulianDate.php and there is a handy tool that will do the date conversion for you. You then copy the Julian date omitting the fraction and paste it into your search. For example if you are looking for  pages mentioning housing benefits Cameron between June 20th and June 26th 2012:

daterange:2456098-2456104 housing benefits Cameron

Once the results are displayed use the Verbatim option to force Google to look for exactly what you’ve asked for.

Now the easy way. There are several tools that will carry out a date limited Google search and one of them can be found at http://gmacker.com/web/content/gDateRange/gdr.htm (many thanks to Richard Clauson who found this for us). Simply fill in the boxes and on the Google results page click on Verbatim at the bottom of the left hand menu.

Why have I detailed the hard way? Because the easy tools may stop working or disappear without trace.

7. Results page sidebar.

Use the sidebar on the left hand side of the results page to focus your search and extra search features. To see all of the options click on the ‘More’ and ‘More search tools’ links. The content of the sidebar changes with the type of search you are running, for example Image search has a colour option.

8. Google Art Project http://www.googleartproject.com/

This is a collaboration between Google and over 150 galleries from across the world.  You can take a virtual tour of a gallery and zoom in on a painting to see the brushstrokes. You can view paintings and drawings by gallery or by artist. Warning: this is highly addictive!

9. Numeric range.

Use this for anything to do with numbers – years, temperatures, weights, distances, prices etc. Use the boxes on the Advanced Search screen or just type in your two numbers separated by two full stops as part of your search. For example:

world oil demand forecasts 2015..2030

10.  Repeat your search terms one or more times. 

Ideal for getting out of a search rut or forcing Google to give you different results. Repeat your main search term or terms to change the order of your results – sometimes radically.

Workshop: Beating Google into submission (Reading)

A reminder that I am running an advanced search workshop concentrating on Google on June 28th, venue Reading University.

Topics will include:

– how Google works – what Google tells us and what we have to guess
– latest developments at Google- what’s gone, what’s new, impact on search results
– how Google customises your results and can you stop it?
– how to focus your search and control Google
– Google’s specialist tools and databases
– what Google is good at and when you should consider alternatives

You will have ample opportunity to experiment and try out the techniques for yourself. Exercises are provided to help you test out the search features but you are free to explore and try out searches of your own.

This workshop is suitable for all levels of experience. The techniques and approaches covered can be applied to all subject areas.

Cost: £150 +VAT (Total £180). A limited number of places for unwaged and students are available; please contact karen.blakeman@rba.co.uk for further details.

Full details are at Beating Google into submission: how to get better search results http://www.rba.co.uk/training/Google.htm