In the previous sections I analyzed webpages from a high level and ignored many details. In this section and the next I dig deeper and get more detail oriented. Roll up your SEO sleeves, things are about to get messy.
The Importance of Good Site Architecture
Before you start examining a website from this level, let me explain the importance of good site architecture.
While writing this book I am working with a large client that is totally befuddled by its poor rankings. (Note: This client had me sign a nasty looking non-disclosure agreement, so I am unable to reveal its name.) The company’s homepage is literally one of the most linked-to pages on the entire Internet and at one point had the elusive PageRank 10. One of its current strategies is to leverage its homepage’s link popularity to bolster a large group of pages optimized for ultra competitive keywords. It wants to cast a wide net with the optimized pages and drive a large amount of search engine–referred traffic to its product pages.
It is a great idea, but with the current execution, it has no chance of working.
The problem is that the website lacks any kind of traditional site architecture. The link juice (ranking power) coming from the hundreds of thousands of domains that link to this company’s homepage has no way of traveling to the other webpages on this domain. All of the link juice is essentially bottled up at the front door.
Its content is located on at least 20 different domains, and there is no global navigation that leads users or search engines from the homepage down to categorized pages. The company’s online presence is more like a thousand islands rather than the super continent it could be. It is an enormous waste of resources and is directly affecting the company’s bottom line in a real way.
When explaining site architecture to clients, I start out by asking them to visualize a website like an ant hill. All of the chambers are like webpages and the tunnels are like internal links. I then have them imagine a little boy pouring water into the ant hill. He pours it down the main entrance and wants to have it fill all of the chambers. (As a side note, scientists actually have done this with cement to study the structure of ant metropolises. In one case, they had to pour 10 tons of liquid cement into an ant hill before it filled all of the chambers.) In this analogy the water represents the flow of link juice to webpages. As discussed earlier, this link juice (popularity) is essential for rankings.
The optimal structure for a website (or ant hill, if you must) would look similar to a pyramid .
This structure allows the most possible juice to get to all of the website’s pages with the fewest number of links. This means that every page on the website gets some ranking benefit from the homepage.
Optimal website structure
A pyramid structure for a website allows the most possible link juice to get to all the website’s pages with the fewest number of links.
NOTE Homepages are almost always the most linked-to pages on a domain. This is because they are the most convenient (the shortest) URL to link to when referring to the website online.
Evaluating Homepages
Now that we are on the same page about site architecture, we can move forward. Once I get to this level of analysis, I start really looking at the site architecture. Obviously, this starts at the homepage.
Ideally, the homepage should link to every single category of pages on a website. Normally, this is accomplished with a global navigation menu (global meaning it is on every web page on the domain). This is easy to do with small websites because if they have less than 150 pages, the homepage could directly link to all of them. (Note this is only a good idea if the homepage has enough links pointing at it to warrant this. Remember the little boy and the ant hill; link popularity is analogous to the amount of water the little boy has. If he doesn’t have enough, he can’t fill every chamber.) Following are some good and bad examples of this.
Evaluating Category Pages
If the website has more than 150 pages, it should divide its content into categories and link to those from the homepage. This is exactly what my former colleagues at SEOmoz look for on medium-sized websites. Ideally, these pages should serve both as a landing page for a searcher and a link juice router for search engines. Many webmasters mistakenly focus on one of these aspects while ignoring the other. As an SEO, part of your job will be making sure that both of these kinds of visitors are taken care of.
A website with more than 150 pages should divide its content into categories that are useful to both humans and search engines alike. 
Multiple Pathways to Content
Well-architected sites can get you to specific pages through multiple pathways. For example, sites like Netflix or IMDb.com are models of cross-linking efficiency. Searching vertically through “Actors,” you’ll find the page of Zooey Deschanel, which will link to pages for
Joseph Gordon-Levitt and (500) Days of Summer. Similarly, searching through romantic comedies will get you to (500) Days of Summer, which are only a click away from the two actors’ pages. And so on.
A good site gets you to deep URLs through a logical path with minimal clicks. A great site gets you to deep URLs through any one of several logical paths, and the paths frequently cross throughout the journey. The difference between good and great means more rapid, more thorough indexing; a more appropriate distribution of page authority and authority; and more qualified traffic.
Here is an example where the website is optimized for engines but not human visitors. This is a problem for two reasons:
First, if a user encounters this page, it will be difficult for them to navigate, and they will likely return to the previous page. This is counterproductive for a navigation system.
Second, search engine engineers preach building websites for humans not search engines. This is a clue to the long-term strategies of the search engines. This example does not appear to be built for humans and, thus, isn’t a good long-term strategy for SEO.
It should be noted that this is not the main movies category page on Rotten Tomatoes, but it still serves a great example of what to avoid when recommending long-term SEO strategies. This page was built as a band-aid to make up for poor site architecture. It is far from optimal.
- A good category page should do all of the following: Be useful for the user
- Direct link juice to all applicable subcategories
- Have enough unique content to be indexed by the search engines
Evaluating Subcategory Pages
If a website is very large it will need to break its categories into subcategories and link to them from the category pages. The subcategory pages should be set up exactly like category pages. The only difference is that instead of linking to more subcategory pages, they should link directly to content pages. Keep in mind that they have less link juice to pass than category pages (because they are more links away from the homepage), so subcategory pages should contain as few links as possible. This is because the amount of link juice a link passes is determined by both the link popularity of the given page and the number of links it contains. Similarly to category pages, subcategory pages should do the following:
- Be useful for the user
- Direct link juice to all applicable content pages
Have enough unique content to be indexed by the search engines
Pagination Solutions for Multiple of Content Pages
What if your subcategory has 100 different products inside it? How should the link architecture look? “Information Hierarchy”—the process of configuring pages so that product and content pages get crawled effectively in large quantities—is one the most challenging aspects of SEO architecture work, and it requires a delicate balance of crawlability and usability concerns.
Having, for example, 100 products within one subcategory can be a challenge, and there are several techniques that can work by themselves or in combination to aid in the crawling and indexing processes for this much content.
First, you can start with the assumption that having all 100 products on the subcategory page creates a negative user experience, and that your preferred number of products per page is closer to 10. If that’s the case, consider some of these options:
Think outside your preferred comfort zone. For some reason, product pages often default to a very small number of results (or products), and I think this is due to bandwidth restrictions that existed long ago. If the links to product pages aren’t particularly image-heavy, consider returning 20, 30, or even 50 products. If the page loads quickly, users don’t mind scrolling vertically.
Create sub-subcategories. Creating five sub-subcategories and linking to all of them from the subcategory page is really no different than linking to five separate content results pages. And it can help, especially if there is appreciable keyword demand behind the terms in the sub-subcategory.
Link to multiple pages showing limited numbers of links to content pages, but canonicalize to a “see all” version. Have you ever seen a subcategory page that links to dozens or hundreds of different content pages by linking to the “next” page at the bottom of the page? This type of “chain” navigation is just about the least effective solution possible for getting hundreds of content pages indexed. Instead, offer users and engines a link to “see all,” which links to a page showing all products (and thus linking to all content pages). Your navigation can still link to smaller pages showing fewer product links, but those smaller pages should all canonically point to the “see all” version.
Evaluating Content Pages
Content pages are the meat of websites. They are the reason visitors came to the site, and just like a good breakfast, these pages should leave those visitors feeling fulfilled and smelling of delicious bacon. (I made up that last part, but if a website really did smell like bacon, I would surely link to it.) The pages should be very specific to a given topic (usually a product or an object) and be hyper-relevant.
As an SEO you should be looking to see if the purpose of the page is directly stated in all of the following areas:
Title tag
URL
Content of page
Images
NOTE Good content pages act as link magnets. They are much more likely to receive links than subcategory and category pages. Smart SEOs use this as an advantage and have content pages link back to their applicable category and subcategory pages. This then increases the amount of juice flowing to all of the content pages on the website and makes them all rank better.
Good Example of a Content Page
The content page in this figure is good for a couple of reasons. First the content itself is unique on the Internet (which makes it worthwhile for search engines to rank well) and covers unique content in a lot of depth. If you have a question about Super Mario World, there is a good chance that this page will be able to answer your question.
Aside from content, this page is well laid out. the topic of the page is stated in the title tag (Super Mario World – Wikipedia, the free encyclopedia); the URL (http://en.wikipedia.org/wiki/Super_Mario_World), the page’s content (Notice the page heading, “Super Mario World”); and again within the alt text of the images on the page.
Imagine for a second you are a search engine engineer working at one of the main search engines. Your task is to organize the information on the Web and make it universally accessible. The problem is that the information on the Internet is not formatted in any specific format. This makes it incredibly difficult to write code that can read all of the information on the Internet, much less organize it. The page uses multiple elements to describe what it is about. These clues are indispensible. By relying on these, you find it much easier to write code that understands what the page is about. Combine this with the fact that this page is part of a trusted and well linked to resource on the Web and it is easy to see why search engineers write code to make this example rank highly for relevant queries.
Bad Example of a Content Page
In contrast, a much less effective example of a content page. Notice how it differs from the first example.
This figure shows a less search engine–friendly example of a content page that is targeting the term Super Mario World. While the subject of the page is present in some of the important elements of the webpage (title tag and images), the content is less robust than the Wikipedia example, and the relevant copy on the page is less helpful to a reader.
Notice that the description of the game is suspiciously similar to copy written by a marketing department: “Mario’s off on his biggest adventure ever, and this time he’s brought along a friend.” That is not the language that searchers query for, and it is not the type of message that is likely to answer a searcher’s query. Compare this to the first sentence of the Wikipedia example, “Super Mario World is a platform game developed and published by Nintendo as a pack-in launch title for the Super Nintendo Entertainment System.” In the GameFAQs example, all that is established by the first sentence is that someone or something named Mario is on an adventure that is bigger than his or her last (how do you quantify that?) and he or she is accompanied by an unnamed friend. On the other hand, the Wikipedia sentence tells the reader that Super Mario World is a game developed and published by Nintendo for the gaming system Super Nintendo Entertainment System.
If you were a search engineer, which example would you want to rank higher for queries related to this game? Search results show that Bing and Google engineers think the Wikipedia example should rank better.
An Ideal Content Page
An ideal content page should do all of the following:
Be hyper-relevant to a specific topic (usually a product or single object) Include subject in title tag
Include subject in URL
Include subject in image alt text
Specify subject several times throughout text content
Provide unique content about a given subject
Link back to its category page
Link back to its subcategory page
Link back to its homepage (normally accomplished with an image link showing the website logo on the top left of a page)
Evaluating URL Structure
Along with smart internal linking, SEOs should make sure that the category hierarchy is reflected in URLs.
Take a look at the following good example of URL structure:
NOTE Remember to consider case when you’re considering URL structure. Different platforms capitalize terms in different ways.
Remember that any case style is fine (lowercase, mixed case, and so on), but that whatever method you choose, you must stick to it and watch for unwanted variants being indexed. Two URLs with the same characters—but different case styles—are considered two distinct
URLs by engines.
This URL is effective because it clearly shows the hierarchy of the information on the page (history as it pertains to video games in the context of games in general). This information is used to determine the relevancy of a given webpage by the search engines. Using the hierarchy, the engines can deduce that the page likely doesn’t pertain to history in general but rather to that of the history of video games. This makes it an ideal candidate for search results related to video game history. Engines can speculate on all of this information without even needing to process the content on the page.
An effective URL structure can help search engines understand how useful or relevant a given webpage is.
Now take a look at the following example of URL structure:
Unlike the first example, this URL does not reflect the information hierarchy of the website. You can see that the given page relates to titles and is on the IMDb website, but you cannot determine what the page is about. The reference to tt0468569 does not directly imply anything that a web surfer is likely to search for. This means that the information provided by the URL is of very little value to search engines.
If you were a search engineer, which page would want to be included at the top of a search results page? The answer, of course, depends on the content and link profile of the given page, but instead of the URL supplementing this information, it is adding nothing.
NOTE The IMDb example is interesting because the URL ranks so well for “the dark knight” despite its URL structure. One of the reasons this site is so authoritative is because its movie-specific pages begin to accrue links well before the release date of films, and this head start is able to overcome less significant factors like semantic signals from the URL itself.
Still, true optimization requires examining all available avenues of improvement and weighing the benefit of implementing them. Could the IMDb page rank higher than the movie’s microsite itself if the URL were simplified? And how would a more semantically clean URL affect click-through? Certainly not negatively.
URL structure is important because it helps the search engines to understand the relative significance of and adds a useful relevancy metric to the given page. It is also helpful for links because people are more likely to link with the relevant anchor text if the keywords are included in the URL.
Action Checklist
When you are viewing a website from the 10-foot level, be sure to check for and note all of the following: Homepage links to every category of pages on the website
The ability of category pages to help the user and the search engines
The presence of links on category pages to all applicable subcategories (if the given amount of page link juice can sustain it)
The ability of subcategory pages to help the user and the search engines The presence of links on subcategory pages to all applicable content pages The relevancy of the given content pages to the given topic
The ability of the URL structure to match category hierarchy and supplement relevancy
In this section I discussed what to look for when analyzing site architecture. I covered the different aspects that are important for category, subcategory, and content pages, and I included a lot of graphical examples (because picture books are easier to read than text-based books).
In the following section I want to dive down to the lowest level of webpages and examine individual pieces of content. If text is the currency of the Internet, consider the pieces of content to be dollar bills. Your mother was mostly right: money does not grow on trees, but it does grow on the Internet.



