Welcome our webmaster and SEO forum
Please enjoy the forum, contribute what you can, and wind up the Moderators!
Results 1 to 4 of 4

Thread: How Much Page Does Yahoo Search Index?

  1. #1
    ovi Guest

    Default How Much Page Does Yahoo Search Index?

    One of the lesser-discussed facets of Web searching is the spidering limits of search engines. Even if a search engine is a full-text engine, it may not search the entirety of a given page if it’s too large. In Google’s case the limit is 101K for HTML pages (its spider will only index the first 101K of an HTML Web page; search Google for aardvark apple zither zephyr filetype:html and look at the file sizes of the results) and ? for PDF pages. (I can’t see the limit; if you look at tinyurl.com/4px8n ; you’ll see that about two-thirds of the pages listed in the TOC are available in Google’s HTML version. 300K limit? 500K?)

    I knew that Yahoo had a larger index limit, but I didn’t know how large. I learned earlier this week that Yahoo’s limit is the first 150K of a Web page, while its PDF indexing limit is 500K.

    … this is what I’m told, anyway. However, I’m finding something interesting. If you search Yahoo for aardvark apple zither zephyr originurlextension:html (originurlextension: is Yahoo’s gawdawful syntax for filetype:; I’m told they’ll be fixing it soon. Propburgers to Greg Notess of searchengineshowdown.com for educating me about it) you’ll find that filesizes are listed with search results, and the filesizes listed are well over 150K – I see page sizes of over 800K listed here! At least one of the pages listed, at 173K, appears from its cache to be fully indexed (the headers, footers, and copyright disclaimers are all in place – it doesn’t look “cut off") and a cache copied-and-pasted into a text editor weighs in at well over 200K.

    The bottom line is that Yahoo indexes far more of HTML pages than Google; if you’re running searches which might tend to focus on large pages (like word listing searches that might point you to dictionaries) try Yahoo first.

    I take this article from searchenginejournal.com

  2. #2
    jhonmark555 is offline Senior Member
    Join Date
    Mar 2011
    Posts
    109

  3. #3
    blogginginc is offline Junior Member
    Join Date
    Mar 2011
    Posts
    13

    Default

    wonder if anyone will ever try to read your thread full of copy paste spam.

  4. #4
    jhonmark555 is offline Senior Member
    Join Date
    Mar 2011
    Posts
    109

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. DKs experiment - Do search engines index flash files
    By temi in forum SEO Experiments
    Replies: 11
    Last Post: 12-13-2007, 06:10 AM
  2. NO index page
    By mr_bill in forum Boss Cart Installation Help
    Replies: 2
    Last Post: 09-17-2007, 01:27 PM
  3. Index page not in google.
    By OldWelshGuy in forum General Webmaster Talk
    Replies: 2
    Last Post: 06-28-2007, 11:11 AM
  4. Yahoo makes index changes
    By ealex in forum General Search Engine Discussions
    Replies: 1
    Last Post: 11-05-2005, 07:47 AM
  5. Replies: 2
    Last Post: 04-10-2005, 04:07 PM

Bookmarks

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124