Eggcorn Forum

Discussions about eggcorns and related topics

You are not logged in.

Announcement

Registrations were closed for a long time because of forum spam, but I have re-opened them on a trial basis.

The forum administrator (chris dot waigl at gmail dot com) reserves the right to request users to plausibly demonstrate that they are real people with an interest in the topic of eggcorns. Otherwise they may be removed with no further justification. Likewise, accounts that have not been used for posting may be removed.

Thanks for your understanding.

Chris -- 2015-05-30

#1 2009-10-22 01:04:26

JuanTwoThree
Eggcornista
From: Spain
Registered: 2009-08-15
Posts: 376

Using Google as a rough-and-ready corpus.

When you google an exact phrase you get a number of hits suggesting, sometimes, widespread use of a particular expression. Take for example “disillusion of parliament”, which gets a respectable 5,430 hits…...... until you look down and see that there are only two pages and 15 hits in total. A double modal (not an eggcorn of course but another interest of mine) like “might ought to” seems to get 104,000 hits but the pages give up at 847, or 927 including duplicates.

Are the page hits a ratio of the total hits? 15 of 5000 (+/- 1 in 330) doesn’t seem much like 900 of 100,000 (+/- 1 in 111).

So has “disillusion of parliament” been written down on web-pages and the like 5000 times or 15 times? If I’m missing something can someone explain it to me using fairly short words?

On the plain in Spain where it mainly rains.

Offline

 

#2 2009-10-22 11:17:37

kem
Eggcornista
From: Victoria, BC
Registered: 2007-08-28
Posts: 2251

Re: Using Google as a rough-and-ready corpus.

Perhaps my post at the bottom of this thread will help you to understand the process.

I can’t think of any explanation for the hit estimate of “disillusion of parliament” being so high. In this case, Microsoft’s Bing search engine presents a more accurate estimate. Here is the Bing search. If you question Google’s estimate on the frequency of a given phrase, you should check with other search engines.

Offline

 

#3 2009-10-24 10:57:13

JuanTwoThree
Eggcornista
From: Spain
Registered: 2009-08-15
Posts: 376

Re: Using Google as a rough-and-ready corpus.

Thanks. You’ve been a great help.


On the plain in Spain where it mainly rains.

Offline

 

#4 2009-11-02 04:08:17

andrewmason
Member
Registered: 2009-11-02
Posts: 1
Website

Re: Using Google as a rough-and-ready corpus.

Thanks for your help and guidance this will help many others who are using this forum.

Last edited by andrewmason (2009-11-02 04:09:15)


Offline

 

Board footer

Powered by PunBB
PunBB is © 2002–2005 Rickard Andersson
Individual posters retain the copyright to their posts.

RSS feeds: active topicsall new posts