Creating Pages That Search Engines Love - Building Search Engine-Friendly Sites - SEO For Dummies, 6th Edition (2016)

SEO For Dummies, 6th Edition (2016)

Part II. Building Search Engine-Friendly Sites

Chapter 7. Creating Pages That Search Engines Love

In This Chapter

arrow Getting your site read

arrow Understanding the importance of mobile-friendly sites

arrow Knowing what search engines see

arrow Exploring keyword concepts

arrow Creating Web pages

In this chapter, you find out how to create Web pages that search engines really like — pages that can be read and indexed and that put your best foot forward. Before you begin creating pages, I recommend that you read not only this chapter but also Chapter 9 to find out how to avoid things that search engines hate. There are a lot of ways to make a Web site work, and just as many ways to break it, too. Before you get started creating your pages, you should be aware of the problems you may face and what you can do to avoid them.

remember I’m assuming that you or your Web designer understand HTML and can create Web pages. I focus on the most important search engine–related things you need to know while creating your pages. It’s beyond the scope of this book to cover basic HTML and Cascading Style Sheets. On the other hand, many content-management systems (CMS) do a really good job of optimizing for the search engines. These are automated Web-page-creation systems, such as WordPress, Drupal, and Joomla. But there are so many of these systems, I can’t possibly cover that particular subject.

Thus I talk in terms of HTML; you’ll have to translate into the particular features and options of the CMS you happen to be using. For instance, I’m going to discuss how you can put keywords into <IMG> tags using the ALT attribute; if you’re using a CMS, you’ll need to figure out how that particular system allows you to add that text when inserting an image (if it does, and most these days do).

Preparing Your Site

When you’re creating a Web site, the first thing to consider is where to put your site. By that, I mean the Web server and the domain name.

Finding a hosting company

Although many large companies place their Web sites on their own Web servers, most small companies don’t. They shouldn’t do so, in fact, because there’s simply no way a small business can manage a Web server anywhere near as cheaply and reliably as a good hosting company can do it. Rather, a hosting company rents space on its servers to other businesses.

You need to consider many issues when selecting a hosting company, most of which aren’t directly related to the search engine issue. From an SEO perspective, you need to be able to upload pages directly to your site, as most hosts allow, or use a search-engine-friendly CMS system that either the hosting company provides or you install. I can’t help much with that; you’ll have to do some research on the system you are considering to see whether it’s truly search-engine friendly (see Part V Web Extra, Ten Ways to Make WordPress (and Others) Search Engine Friendly).

As you learned in Chapter 3, it’s also important that the hosting company you use is reliable. The servers must work virtually all the time and deliver pages quickly. Slow, unreliable servers can hurt your search-engine rankings. Most hosting companies these days are reasonably reliable, but not all.

Picking a domain name

Search engines read uniform resource locators (URLs), looking for keywords in them. For instance, if you have a Web site with the domain name rodent-racing.com and someone searches Google for rodent racing, Google sees rodent-racing as a match; because a dash appears between the two words, Google recognizes the individual words in the domain name. If, however, you run the words together (rodentracing), Google doesn’t regard the individual words as individual words; it sees them as part of the same word. That’s not to say that Google can’t find text within words — it can, and you sometimes see words on the search results pages partially bolded when Google does just that — but when ranking the page, Google doesn’t regard the word it found inside another word as the same as finding the word itself.

tip To see this concept in action, use the allinurl: search syntax at Google. Type allinurl:rodent, for example, and Google finds URLs that contain the word rodent (including the directory names and filenames).

So, putting keywords into the domain name and separating keywords with dashes provides a small benefit. Another advantage to adding dashes between words is that you can relatively easily come up with a domain name that’s not already taken. Although it may seem as though most of the good names were taken long ago, you can often come up with some kind of keyword phrase, separated with dashes, that’s still available. Furthermore, search engines don’t care which first-level domain you use; you can use .com, .net, .biz, .tv, or whatever; it doesn’t matter.

remember Now, having said all that, let me tell you my philosophy regarding domain names. In the search engine optimization field, it used to be very popular to use dashes and keywords in domain names, but in most cases, the lift provided by keywords in domain names is relatively small; in fact, while the search engines clearly read keywords in domain names, they give much less value to them than they have in the past. Rather, you should consider other, more important factors when choosing a domain name:

· A domain name should be short, easy to spell, and easy to remember. It should also pass the radio test. Imagine that you’re being interviewed on the radio and want to tell listeners your URL. You want something that’s instantly understandable without having to be spelled. You don’t want to have to say “rodent dash racing dash events dot com”; it’s better to say “rodent racing events dot com.”

· In almost all cases, you should get the .com version of a domain name. If the .com version is taken, do not try to use the .net or .org version for branding purposes! People remember .com, even if you say .org or .net or whatever. So, if you’re planning to promote your Web site in print, on the radio, on TV, on billboards, and so on, you need the .com version.

technicalstuffDomain confusion

Several years ago, I recommended that a nonprofit client of mine register the .com version of its domain name. (The client had been using .org for years.) During the few hours that the .com domain was not yet pointing to its site, but instead was pointing to the domain registrar’s site, the nonprofit received several calls from people trying to get to its Web site. These people wanted to let the company know that something was wrong with its server because its domain name was pointing to the wrong place. For years, the company printed all its materials using .org; it had never printed anything with .com because it didn’t own that domain name; however, people were still trying to get to the .com version, a perfect example of what I call domain confusion.

A classic example is a situation involving Rent.com and Rent.net. These two different Web sites were owned by two different companies. Rent.net spent millions of dollars on advertising; every time I saw a Rent.net ad on a bus, I had to wonder how much of the traffic generated by these ads actually went to Rent.com! (The Rent.net domain name is no longer in use — it now points to Move.com — and Rent.com is. I don’t think that’s a coincidence!)

Are keyworded domain names worth the trouble? Because the lift provided by keywords in the domain name may be rather small — and, in fact, putting too many keywords into a name can hurt your placement — you should probably focus on a single, brandable domain name (a .comversion).

warning Be careful about using a domain-forwarding service for Web sites that you want to turn up in search engines. Many domain registrars allow you to simply forward browsers to a particular site: A user types www.domain1.com, and the registrar forwards the browser towww.domain2.com, for instance. Such forwarding systems may use frames (discussed in Chapter 9), which means that search engines don’t index the site properly, or do not use 301 redirects to implement the forward as they should for SEO purposes (see Chapter 24 for more information on 301 redirects).

Mobile readiness and “Mobilegeddon”

In 2015, Google introduced another important concept for Webmasters to worry about: mobile readiness. By this point, a huge proportion of Web site visits, for a wide range of site types, were coming from mobile devices. For example, when I checked the site analytics for a heavy industrial client of mine in early 2015, I found that 29 percent of the traffic came from small mobile devices (smart phones), with another 13 percent from tablets. It isn’t just the younger Internet users or the more technically adept that are moving to mobile. Across the board, a huge proportion of site visits now comes from mobile devices.

However, some sites see much higher percentages. Google has said that it expects overall search queries on mobile devices to exceed desktop searches (within the United States) — perhaps by the time you read this.

That’s been true for some time, but what put the SEO world in a tizzy was when Google started talking about how it was going to start taking into consideration how a site would display on a mobile device, and factor that into ranking for when a search was carried out from such a device. (In an example of typical SEO-world drama, this change was labeled by many as Mobilegeddon.)

That is, if your site doesn’t display well on a smart phone, it might not rank as well when someone searches from a smart phone. (Not surprisingly, there’s no effect on regular-size devices; in fact, laptops aren’t considered mobile devices from this perspective, and tablets are explicitly excluded by Google from this issue — it’s the smart phone searches that are the SEO problem).

So, what’s it mean to you? If you’re about to build a new site, you really should plan to build a mobile-friendly site; that’s easier than you might imagine, as many recent versions of CMS tools can do this for you automatically. If you already have a site … well, that’s a problem; it might take a lot of work to rebuild.

However, most Web sites aren’t mobile friendly; thus, many of your competitors probably don’t have mobile friendly sites, limiting the “damage.” Long term, this could cause your business serious problems.

There are essentially three ways to make a site mobile friendly:

· Responsive Design: This uses sophisticated HTML to automatically change the layout of a site’s Web pages to suit the particular device on which it is being displayed. It’s probably the most popular method these days.

tip Google recommends responsive design.

· Dynamic Serving: With this method, you have two versions of your Web site: one for smart phones, and another for larger devices. The Web server decides which version to send when a page is requested.

· Separate URLs: In this case, you have two versions of your site. The server directs the user to the most appropriate site, on a different URL, depending on the device.

If I go any further, it will be way beyond the scope of this book, but here are a couple of pages to show you more:

· Google’s information on making your site mobile friendly:

https://developers.google.com/webmasters/mobile-sites

· Google’s test tool; enter a Web page URL and Google will tell you whether it’s mobile friendly:

https://www.google.com/webmasters/tools/mobile-friendly

HTTPS

Google is now encouraging Web sites to go HTTPS. HTTPS means HyperText Transfer Protocol over SSL (or over TLS), or maybe HyperText Transfer Protocol Secure. Let’s not get too geeky here; essentially, when your Web browser uses HTTPS to communicate with a Web server, the communication is encrypted. Believe it or not, even the NSA can’t read the communications if they’re intercepted. It’s HTTPS that’s being used by ecommerce sites when you send your credit-card information to make a purchase; you’ve probably seen https:// for a URL in the browser Location bar, or a little lock icon indicating the user of HTTPS. (In fact, it isn’t all about encryption of communications, it’s also about authentication and data integrity; I’ll leave you delve into the details if you’re interested.)

In 2014, Google launched its HTTPS Everywhere program, a program aimed at getting the entire Web to go HTTPS. In other words, not just financial transactions should use HTTPS, but everything going across the Web should use it. I won’t go into detail about why; you can search forhttp everywhere to learn more.

However, as part of this program, Google has decided to start using whether or not a page is being delivered across an HTTPS connection as part of the ranking algorithm. Before you get worried, right now it doesn’t count for much. Here’s what Google said when it first announced this:

…we’re starting to use HTTPS as a ranking signal. For now it’s only a very lightweight signal — affecting fewer than 1% of global queries, and carrying less weight than other signals such as high-quality content — while we give webmasters time to switch to HTTPS. But over time, we may decide to strengthen it, because we’d like to encourage all website owners to switch from HTTP to HTTPS to keep everyone safe on the web.

The writing’s on the wall. Anyone beginning a new Web site probably should take this into consideration from the start, and owners of existing sites might want to consider adding HTTPS soon. It isn’t hugely complicated, but it’s a bit geeky, and beyond the scope of this book. You can find more information here:

https://support.google.com/webmasters/answer/6073543

Seeing Through a Search Engine’s Eyes

What a search engine sees when it loads one of your pages isn’t always the same as what your browser sees. To understand why, you need to understand how a Web page is created. See Figure 7-1 and read this quick explanation:

1. A user types a URL into his browser or clicks a link, causing the browser to send a message to the Web server asking for a particular page.

2. The Web server grabs the page and quickly reads it to see whether it needs to do anything to the page before sending it.

3. The Web server compiles the page, if necessary.

In some cases, the Web server may have to run ASP or PHP scripts, for instance, or it may have to find an SSI (server-side include), an instruction telling it to grab something from another page and insert it into the one it’s about to send.

4. After the server has completed any instructions, it sends the page to the browser.

5. When the browser receives the page, it reads through the page looking for instructions and then, if necessary, further compiles the page.

6. When the browser is finished, it displays the page for the user to read.

image

Figure 7-1: How Web pages are created.

Here are a few examples of instructions the browser may find inside the file:

· <SCRIPT> tags containing JavaScript scripts or references to scripts stored in other files: The browser then runs those scripts.

· Cascading Style Sheets (CSS): These instructions tell the browser how the page — in particular, the text on the page — should be formatted.

· References to images or other forms of media: After the browser finds these references, it pulls them into the page.

So that’s what happens normally when Web pages are created. But what about searchbots, the programs used by search engines to index pages? Well, in the past, they worked differently. When a searchbot requested a page, the server did what it would normally do — construct the page according to instructions and send it to the searchbot. But the searchbot didn’t follow all the instructions in the page — it just read the page. For example, it wouldn’t run scripts in the page, and it wouldn’t use the Cascading Style Sheet information to format the page.

technicalstuffScripting

ASP and PHP scripts are little programs that are written into Web pages. The scripts are read by a program working in association with the Web server when a page is requested. The searchbots see the results of these scripts because the scripts have been run by the time the Web server sends the page. Server-side includes (SSIs) are simple statements placed into the HTML pages that name another file and, in effect, say to the Web server, “Grab the information in this file and drop it into the Web page here.” Again, the searchbots see the information in the SSI because the Web server inserts the information before sending the Web page.

Thus, you can use two kinds of instructions to build Web pages:

· Server-side instructions: These instructions, such as ASP and PHP scripts and SSI instructions, are carried out by the server before sending the information to the searchbots.

· Browser-side (or client-side) instructions: These instructions are embedded into the Web page and, in the past, generally ignored by searchbots. For instance, if you created a page with a navigation system built with JavaScript, the search engines may not be able to read it. Some people even used browser-side instructions to intentionally hide things from search engines.

But things are different today; the only question is how much. Google, for instance, now does read browser-side scripting. It can read JavaScript, for instance, allowing it to read content inserted into pages using scripts. For example, it can read Facebook page content, much of which is inserted using JavaScript browser side. Today’s searchbots are much more sophisticated than in the past. But that doesn’t necessarily mean they’ll read everything inserted browser side. There’s an almost limitless variety of ways to use scripts and stylesheets; it’s likely that the searchbots decipher some things and not others, and some search engines do a better job of seeing the results of browser-side scripts than others.

My belief is that if you want to make sure that something is definitely seen by the search engines, place it into the page server side. Yes, Google can read Facebook pages, but that doesn’t necessarily mean it’ll spend the time to figure out the scripts on your pages.

remember These concepts are very important:

· Server side = visible to searchbots

· Browser side = sometimes not visible to searchbots

Understanding Keyword Concepts

Here’s the basic concept of using keywords: You put keywords into your Web pages in such a manner that search engines can find them, read them, and regard them as significant.

Your keyword list is probably very long, perhaps hundreds of keywords, so you need to pick a few to work with. (If you haven’t yet developed a keyword list, refer to Chapter 6 for details.) The keywords you pick will be either

· Words near the top of the list that have many searches.

· Words lower on the list — in the “long tail — that may be worth targeting because you have relatively few competitors for those keywords.

tip It’s often easy to create pages that rank well for the keywords at the bottom of your list because they’re unusual terms that don’t appear in many Web pages. However, they’re at the bottom of your list because people don’t often search for them! Therefore, you have to decide whether it’s worthwhile to rank well on a search term that’s searched for only once or twice a month.

Picking one or two phrases per page

You optimize each page for one or two keyword phrases. By optimize, I mean that you create the page in such a manner that it has a good chance of ranking well for the chosen keyword phrase or phrases when someone uses them in a search engine.

remember You can’t optimize a page well for more than one keyword phrase at a time. For example, the <TITLE> tag is one of the most important components on a Web page, and the best position for a keyword is at the beginning of that tag. Clearly, you can place only one phrase at the beginning of the tag. (However, sometimes, as you find out in Chapter 6, you can combine keyword phrases — for example, optimizing for rodent racing scores also, in effect, optimizes for rodent racing.)

Have primary and secondary keyword phrases in mind for each page you’re creating, but also consider all the keywords you’re interested in working into the pages. For instance, you might create a page that you plan to optimize for the phrase rodent racing, but you also have several other keywords that you want to scatter around your site: rodent racing scores, handicap, gerbil, rodentia, furry friend events, and so on. Typically, you pick one main phrase for each page and incorporate the other keyword phrases throughout the page, where appropriate.

tip Place your keyword list into a word processor, enlarge the font, and then print the list and tape it to the wall. Occasionally, while creating your pages, glance at the list to remind yourself which words you need to weave into your pages.

Checking prominence

The term prominence refers to where the keyword appears — how prominent it is within a page component (the body text, the <TITLE> tag, and so on). A word near the top of the page is more prominent than one near the bottom; a word at the beginning of a <TITLE> tag is more prominent than one at the end; a word at the beginning of the DESCRIPTION meta tag is more prominent than one at the end; and so on.

Prominence is good. If you’re creating a page with a particular keyword or keyword phrase in mind, make that term prominent — in the body text, in the <TITLE> tag, in the DESCRIPTION meta tag, and elsewhere — to convey to search engines that the keyword phrase is important in this particular page. Consider this title tag:

<TITLE>Everything about Rodents - Looking after Them, Feeding Them, Rodent Racing, and More.</TITLE>

When you read this tag, you can see that Rodent Racing is just one of several terms the page is related to. The search engine comes to the same conclusion because the term is near the end of the title, meaning that it’s probably not the predominant term. But what about the following tag?

<TITLE>Rodent Racing - Looking after Your Rodents, Feeding Them, Everything You Need to Know</TITLE>

Placing Rodent Racing at the beginning of the tag places the stress on that concept; search engines are likely to conclude that the page is mainly about rodent racing.

Watching density

Another important concept is keyword density. When a user searches for a keyword phrase, the search engine looks at all pages that contain the phrase and checks the density — the ratio of the search phrase to the total number of words in the page.

Suppose that you search for rodent racing and the search engine finds a page that contains 400 words, with the phrase rodent racing appearing 10 times — that’s a total of 20 words. Because 20 is 5 percent of 400, the keyword density is 5 percent.

Keyword density is important, but you can overdo it. If the search engine finds that the search phrase makes up 50 percent of the words in the page, it may decide that the page was created purely to grab the search engine’s attention for that phrase and thus decide to ignore it. On the other hand, if the density is too low, you risk having the search engines regard other pages as more relevant for the search.

You can get hung up on keyword density, and some people use special tools to check the density on every page. This strategy can be very time consuming, especially for large sites. You’re probably better off eyeballing the density in most cases.

tip Here’s my general rule: If the phrase for which you’re optimizing appears an awful lot, you’ve overdone it. If the text sounds clumsy because of the repetition, you’ve overdone it.

Placing keywords throughout your site

Suppose that someone searches for rodent racing, and the search engine finds two sites that use the term. One site has a single page in which the term occurs; the other site has dozens of pages containing the term. Which site does the search engine think is most relevant? The one that has many pages related to the subject, of course.

In many cases, you’re not likely to grab a top position by simply creating a single page optimized for the keyword phrase. You may need dozens, perhaps hundreds, of pages to grab the search engines’ attention (with plenty of links between pages and from other sites back to yours — which you find out about in Chapters 16 through 18).

Creating Your Web Pages

remember When you’re creating your Web pages, you need to focus on two essential elements:

· The underlying structure of the pages

· The text you plunk down on the pages

The next sections fill you in on what you need to look out for.

Naming files

Search engines get clues about the nature of a site from its domain name as well as from the site’s directory and file structure. The added lift is probably not large, but every little bit counts, right? You might as well name directories, Web pages, and images by using keywords.

For example, rather than create a directory named /events/, you could name it /rodent-racing-events/. Rather than have a file named gb123.jpg, you can use a more descriptive name, such as rodent-racing-scores.jpg. Don’t have too many dashes in the file and directory names, though, because overdoing it may cause search engines to ignore the name.

remember You should separate keywords in a name with dashes, but not with underscores, despite what your Web designer may tell you. In the olden days, Google regarded underscores as part of the word, and dashes as separators (as a space, in effect; for obscure technical reasons, you should never put actual spaces in folder and filenames).

Years ago, rumors spread around the SEO community that Google had changed this behavior, and that it now treated underscores and dashes as the same thing. However, the last really reliable comment on this issue that I have been able to find (from Matt Cutts; see Chapter 23) is that Google never actually made this change, though they had considered it. (Ironically, it was a perhaps misinterpreted comment from Matt that started the rumor.)

So it seems that Google still (most likely) regards rodent_racing, for instance, as a single word, but rodent-racing as two words. Other search engines may operate differently; for instance, Bing probably treats both symbols the same, and likely always has. Thus, I still recommend dashes — hyphens — for several reasons:

· It’s what Google recommends, and if it’s good enough for Google, it’s good enough for me.

· Some search engines may still have problems with underscores; most likely, Google still does.

· There was never any real reason for using underscores anyway, despite what many geeks seemed to think.

It isn’t a huge deal, but in the world of SEO, every little optimization element counts. So don’t let anyone tell you that you should be using underscores rather than dashes in file and directory names — it’s simply not true!

Creating directory structure

First, let me explain a common belief in the SEO business: It may be a good idea to keep a flat directory structure in your Web site — that is, keep your pages as close to the root domain as possible, rather than have a complicated multilevel directory tree. Create a directory for each navigation tab and keep all the files in that directory.

Many observers believe that search engines downgrade pages that are lower in the directory structure. This effect is probably small, but the theory is that you’re better off using a structure with fewer sublevels than with more. For instance, the first page that follows, according to this theory, would be weighted more highly than the second page:

http://www.domainname.com/page.html

http://www.domainname.com/dir1/dir2/dir3/dir4/page.html

However, a flat directory structure is probably not terribly important. Matt Cutts of Google (see Chapter 23) claims that the directory structure doesn’t matter to Google, the single most important search engine, of course. And sometimes it’s nice to use a directory structure to add a few keywords to the URL to tell the search engines what the page is about. For instance, you might have the following in a real estate site:

http://www.domain.com/real-estate/colorado-homes-for-sale/denver-county/lakewood.html

Don’t use too many hyphens, though; a few here and there are okay (I have three in the second directory name), but overdoing it might cause problems.

Viewing TITLE tags

Most search engines use the site’s <TITLE> tag as the link and main title of the site’s listing on the search results page, as shown in Figure 7-2.

image

Figure 7-2: Search results from Google, showing where components come from.

<TITLE> tags tell a browser what text to display in the browser’s title bar and are very important for search engines as well. Searchbots read page titles and use that information to determine what the pages are about. If your <TITLE> tags have a keyword between them, search engines naturally assume that the word must have something to do with the content or purpose of the Web page.

remember The <TITLE> tag is one of the most important components as far as search engines are concerned. However, these tags are often wasted because so many sites don’t bother placing useful keywords in them. Titles are often generic: Welcome to Acme, Inc., or Acme Inc. – Home Page. Such titles are not beneficial for search engine optimization.

I searched at Google for intitle:welcome to find out how many pages have the word welcome in their <TITLE> tags. The result? Around 198 million! Around 125 million have welcome to in the title (allintitle:”welcome to”). Interestingly, these numbers are far lower than they were a few years ago, perhaps indicating evolving ideas related to Web design and SEO.

Having Welcome to as the first words in your title is a waste of space — only slightly more wasteful than your company name! Give the search engines a really strong clue about your site’s content by using a keyword phrase in the <TITLE> tags. Here’s how:

1. Place your <TITLE> tags immediately below the <HEAD> tag.

2. Place 40 to 60 characters between the <TITLE> and </TITLE> tags, including spaces.

3. Put the keyword phrase you want to focus on for this page at the beginning of the <TITLE> tag.

If you want, you can repeat the primary keywords once. Limit the number of two-letter words and very common words (known as stop words), such as as, the, and a, because search engines ignore them.

Here’s a sample <TITLE> tag:

<TITLE>Rodent Racing Info. Rats, Mice, Gerbils, Stoats, All Kinds of Rodent Racing</TITLE>

(By the way, I am aware that a stoat is not, strictly speaking, a rodent, but it looks quite rodenty to me, and I’m sure someone races them.)

Although the title tag is used as the link on the search-results page, it will be truncated if it’s too long. On Google, for instance, depending on spacing between words, the title may be truncated to around 50 to 55 characters and have an ellipsis (…) displayed at the end of the line — that is,Rodent Racing Info. Rats, Mice, Gerbils, Stoats, All Kinds of Rodent Racing may be displayed as Rodent Racing Info. Rats, Mice, Gerbils, Stoats, All… — so make sure you get the important words that you want to be visible on the search-results page into those first 55 or so characters.

Also, use title case (capitalizing the initial letter of each word, as in Rodent Racing Info. Rats, Mice, Gerbils, Stoats, All Kinds of Rodent Racing, not Rodent racing info. rats, mice, gerbils, stoats, all kinds of rodent racing). Title case is much easier for people to read as they scan down the page.

tip The <TITLE> tag and often the DESCRIPTION tag (explained in the next section) appear on the search results page, so both these tags should contain text designed to encourage people to visit your site.

Using the DESCRIPTION meta tag

Meta tags are special HTML tags that you can use to carry information, which browsers or other programs can then read. When Internet search engines were first created, Webmasters included meta tags in their pages to make it easy for search engines to determine what the pages were about. Search engines also used these meta tags when deciding how to rank the page for different keywords.

The DESCRIPTION meta tag describes the Web page to the search engines. Search engines use this meta tag in two ways:

· They read and index the text in the tag and, in some circumstances, use it to figure out a page’s relevance to a search term. (Google claims it doesn’t use the tag for page ranking, but other search engines may.)

· In many cases, they use the text verbatim in the search results page. That is, if your Web page is returned in the search results page, the search engine may grab the text from the DESCRIPTION tag and place it under the text from the <TITLE> tag so that the searcher can read your description.

Now, this process can vary between search engines, and over time for the same search engine. Until sometime in 2007, in most cases, Google didn’t use the text from the DESCRIPTION meta tag in its search results page. Rather, Google grabbed a block of text where it found the search keywords on the page and then used that text in the results page. However, these days it will often use the DESCRIPTION tag text if it finds the searched-for words in the description. If it doesn’t, or if it finds only some of the searched for words, it may grab text from within the page content and display that instead. The DESCRIPTION meta tag is pretty important, so you should definitely use it. Think of it as serving two purposes:

· It’s a page-ranking tool, helping, in some search engines, the page to rank higher on the search-results page.

· It’s a sales pitch, seen by people viewing the search engine’s search-results page; it should encourage people to click the link.

As is the <TITLE> tag, the DESCRIPTION may be truncated; in Google search results, the text will be truncated to around 150 characters. For instance, here’s the description-tag text from CNN.com:

Find the latest breaking news and information on the top stories, weather,
business, entertainment, politics, and more. For in-depth coverage, CNN provides special reports, video, audio, photo galleries, and interactive guides

And here’s how it appears in the Google search results:

Find the latest breaking news and information on the top stories, weather,
business, entertainment, politics, and more. For in-depth coverage, CNN provides
… .

So if you want part of the text to be seen, make sure it appears before that 150th-or-so character. You can have a longer description and get a few more keywords in there (though Google doesn’t use it for ranking, so you won’t be helping your position, at least with that search engine), but don’t make it too long; perhaps 220 or 230 characters maximum, including spaces (the preceding CNN description is 225). Place the DESCRIPTION tag immediately below the <TITLE></TITLE> tags tags. Here’s an example:

<META NAME="description" CONTENT="Rodent Racing - Scores, Schedules,
Everything Rodent Racing. Mouse Racing, Stoat Racing, Rat Racing, Gerbil Racing.
The Web's Top Rodent Racing Systems and Racing News">

As with the <TITLE> tag, title case is also a good thing; title case makes the DESCRIPTION text easier to read. Duplicating your most important keywords once is okay, but don’t overdo it, or you’ll upset the search engines. Don’t, for instance, do this:

<META NAME="description" CONTENT="Rodent Racing, Rodent Racing,
Rodent Racing, Rodent Racing, Rodent Racing, Rodent Racing, Rodent Racing,
Rodent Racing, Rodent Racing, Rodent Racing">

warning Overloading your DESCRIPTION (or any other page component) with the same keyword or keyword phrase is known as spamming (a term I hate, but hey, I don’t make the rules), and trying such tricks may get your page penalized; rather than help your page’s search engine position, it may cause search engines to omit it from their indexes.

I also recommend avoiding sentences in your Description tag; rather, try to get the important words in front of people, conveying important information very quickly. Users generally scan search-results pages, looking at each search result for no more than about a second before moving to the next. If you provide a sentence for them to dig through … they won’t. For instance, consider these two descriptions:

Enjoy the experience of finding clearance shoes, clothing, and accessories at
the very best prices. You’ll find fashions from Acme, athletic gear from Widget,
comfortable shoes from Thingamibob

Clearance Shoes, Clothing, Accessories – Very Best Prices – Fashions: Acme -
Athletic Gear: Widget - Comfortable Shoes: Thingamibob

These two pieces of text convey exactly the same information, but it’s much quicker and easier for the searcher to understand the second example than the first. You should consider your DESCRIPTION tag to be not only a search engine component but also a sales tool. Remember that much of the tag — perhaps the first 150 to 160 characters — will quite likely be seen in the search results, so you want to use text that encourages people to click the link — text that helps your links stand apart from the others on the page. An example is a compelling sales message or your phone number, which helps build credibility by ensuring that people recognize that it’s a real site and not some search engine spam result! (Using a phone number also has the effect of making your listing stand out a little; the eye “trips” over changes in the pattern, such as numbers and capitalized words.) You might also want to think about your site’s Unique Selling Proposition (USP). What makes your site special compared to others — a huge selection? Free shipping? Remember, this is a sales pitch to get people to click, so think about how you can do that.

Tapping into the KEYWORDS meta tag

The KEYWORDS meta tag was originally created as an indexing tool — that is, a way for the page author to tell search engines what the page is about by listing (yep) keywords. Although quite important many years past, this meta tag isn’t so important these days. Some search engines may use it, but many don’t, and even those that do use it don’t give it much value. (Google and Bing almost certainly don’t use it. Ask.com may, but again, it doesn’t give it much weight.) Still, you might as well include the KEYWORDS meta tag. You do have a list of keywords, after all.

Don’t worry too much about the tag — it’s not worth spending a lot of time over. Here are a few points to consider, though:

· Limit the tag to 10 to 12 words. Originally, the KEYWORDS tag could be very large, up to 1,000 characters. These days, many in the optimization business are wary of appearing to be spamming search engines by stuffing keywords into any page component, and thus recommend that you use short KEYWORDS tags.

· You can separate each keyword with a comma and a space. However, you don’t have to use both — you can have a comma and no space, or a space and no comma.

· Make sure that most of the keywords in the tag are also in the body text. If they aren’t, the tag probably won’t do you any good. Many people also use the KEYWORDS tag as a good place to stuff spelling mistakes that are commonly searched.

· Don’t use a lot of repetition. You shouldn’t do this, for instance: Rodent Racing, Rodent Racing, Rodent Racing, Rodent Racing, Rodent Racing, Rodent Racing, or even Rodent Racing, Rodent Racing Scores, Rodent Racing, Gerbils, Rodent Racing Scores, Rodent Racing… .

· Don’t use the same KEYWORD tag in all your pages. You can create a primary tag to use in your first page and then copy it to other pages and move terms around in the tag.

Here’s an example of a well-constructed KEYWORD tag:

<META NAME="keywords" CONTENT="rodent racing, racing rodents, gerbils, mice, mouse, raceing, mouse, rodent races, rat races, mouse races, stoat, stoat racing">

But again, don’t waste much time on this tag. It’s really not important at all.

Using other meta tags

What about other meta tags? Sometimes if you look at the source of a page, you see all sorts of meta tags, as shown in Figure 7-3 (in bold to make seeing them easier). Meta tags are useful for various reasons, but from a search engine perspective, you can forget most of them. (And most meta tags really aren’t of much use for any purpose.)

image

Figure 7-3: Examples of meta tags you generally don’t need.

You’ve heard about DESCRIPTION and KEYWORDS meta tags, but also of relevance to search engine optimization — though not always useful — are the REVISIT-AFTER and ROBOTS meta tags:

· REVISIT-AFTER tells search engines how often to reindex the page. Save the electrons; don’t expect search engines to follow your instructions. Search engines re-index pages on their own schedules.

· ROBOTS blocks search engines from indexing pages. (I discuss this topic in detail in “Blocking searchbots,” later in this chapter.) But many Web authors use it to tell search engines to index a page. Here’s an example:

<META NAME="robots" CONTENT="ALL">

This tag is a waste of time. If a search engine finds your page and wants to index it, and hasn’t been blocked from doing so, it will. And if the search engine doesn’t want to index the page, it doesn’t. Telling the search engine to do so doesn’t make a difference.

There are also a few special Google meta tags:

<META NAME="googlebot" CONTENT="nosnippet">

This meta tag tells Google not to place a description under a search result’s link. This tag also has the effect of removing the page from Google’s cache (the saved version of an indexed Web page; you see Cached links by entries in the search-results page).

Here’s another example:

<META NAME="googlebot" CONTENT="noarchive">

This meta tag tells Google not to place a copy of the page into the cache. (The nosnippet tag incorporates the noarchive command.) If you have an anal corporate attorney on staff who doesn’t like the idea of Google storing a copy of your company’s information on its servers, you can tell Google not to. There’s also the NOINDEX tag, which tells Google not to index the page, and the NOFOLLOW tag, which tells it not to follow links from that page.

Including image ALT text

You use the <IMG> tag to insert images into Web pages. This tag can include the ALT= attribute, which means alternative text. ALT text was originally displayed if the browser viewing the page couldn’t display images. ALT text is also used by programs that “speak” the page (for individuals without sight). In many browsers, ALT text also appears in a little pop-up box when you hold your mouse over an image for a few moments.

ALT attributes are also read by search engines. Why? Because these tag “attributes” offer another clue about the content of the Web page, by providing a description of what the image is. How much do ALT tags help? Perhaps not so much these days, because some Web designers have abused the technique by stuffing ALT attributes with tons of keywords. But using ALT attributes can’t hurt (assuming that you don’t stuff them with tons of keywords, but rather simply drop in a few here and there) and may even help push your page up a little in the search engine rankings (and can also help the specific image rank for those keywords in Image Search results).

You can place keywords in your ALT attributes like this:

<IMG SRC="rodent-racing-1.jpg" ALT="Rodent Racing - Ratty Winners of our Latest Rodent Racing Event">

It’s definitely a good idea to use ALT attributes on image links, by the way. Doing so gives Google an idea of what the page referenced by the link is about.

Adding body text

remember You need text in your page. How much? More than a little, but not too much. Maybe 200 to 400 words is a good range. Don’t get hung up on these numbers, though. If you put an article in a page and the article is 1,000 words, that’s fine, and some pages may not have much text at all. But in general, when building a page that you want people to find in the search engines, a number in the 200 to 400 word range is good. That amount of content allows you to really define what the page is about and helps the search engine understand what the page is about.

Keep in mind that a Web site needs content in order to be noticed by search engines. (For more on this topic, see Chapter 11.) If the site doesn’t have much content for the search engine to read, the search engine will have trouble determining what the page is about and may not properly rank it. In effect, the page loses points in the contest for search engine ranking. Certainly, placing keywords in content is not all there is to being ranked in search engines; as you find out in Chapters 16 through 19, for instance, linking to the pages is also very important. But keywords in content are very significant, so search engines have a natural bias toward Web sites with a large amount of content.

This bias toward content could be considered very unfair. After all, your site may be the perfect fit for a particular keyword search, even if you don’t have much content in your site. In fact, inappropriate sites often appear in searches simply because they have a lot of pages, some of which have the right keywords.

Suppose that your rodent-racing Web site is the only site in the world at which you can buy tickets for rodent-racing events. Your site doesn’t provide a lot of content because rodent-racing fans simply want to be able to buy tickets and nothing more. However, because your site has less content than other sites, it is at a disadvantage to sites that have lots of content related to rodent racing, even if these other sites aren’t directly related to the subject. (On the other hand, if rodent-racing fans throughout the world decide that your site is the one on which to buy tickets, and enough of them link to you, you can still rank well regardless of how much page content you have.)

You can’t do much to confront this problem, except to add more content (or create a lot of links)! You can find some ideas on where to get content in Chapter 11 and read all about links in Chapters 16 through 19.

Creating headers: CSS versus <H> tags

Back when the Web began, browsers defined what pages looked like. A designer could say, “I want body text here and a heading there and an address over there,” but the designer had no way to define what the page actually looked like. The browser decided. The browser defined what a header looked like, what body text looked like, and so on. The page might appear one way in one browser and another way in a different browser.

These days, designers have a useful tool available to them: Cascading Style Sheets (CSS). With CSS, designers can define exactly what each element should look like on a page.

Now, here’s the problem. HTML has several tags that define headers: <H1>, <H2>, <H3>, and so on. These headers are useful in search engine optimization because when you put keywords into a heading, you’re saying to a search engine, “These keywords are so important that they appear in my heading text.” Search engines pay more attention to them, weighing them more heavily than keywords in body text.

But many designers have given up on using the <H> tags and rely solely on CSS to make headers look the way they want them to. The plain <H> tags are often rather ugly when displayed in browsers, so designers don’t like to use them. <H> tags also cause spacing issues; for example, an <H1> tag always includes a space above and below the text contained in the tag.

remember However, there’s no reason you can’t use both <H> tags and CSS. You can use style sheets in two basic ways:

· Create a style class and then assign that class to the text you want to format.

· Define the style for a particular HTML tag.

Many designers do the former; they create a style class in the style sheet, as in the following example:

.headtext { font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 16px; font-weight: bold; color: #3D3D3D }

Then they assign the style class to a piece of text, like this:

<DIV CLASS="headtext”>Rodent Racing for the New Millennium!</div>

In this example, the headtext class makes the text appear the way the designer wants the headings to appear. But, as far as search engines are concerned, this is just normal body text.

A better way is to define the <H> tags in the style sheets, as in the following example:

H1 {
font-family: Verdana, Arial, Helvetica, sans-serif;
font-size: 16px;
font-weight: bold;
color: #3D3D3D
}

Now, whenever you add an <H1> tag to your pages, the browser reads the style sheet and knows exactly which font family, size, weight, and color you want. It’s the best of both worlds — you get the appearance you want, and search engines know it’s an <H1> tag. (Note to Web designers who don’t want to listen: Don’t take my word for it. Even Google recommends that you use <H> tags.)

I have heard, in the last year or so, some in the SEO industry stating that H1 tags don’t matter anymore; in one case, because the H1 tags don’t help much, and putting the tags in a larger font seems to help just as much. My response? Again, Google recommends you use H tags, so until I hear someone at Google stating otherwise, that’s the way I’d go. And as you can use CSS to format the H tags to look any way you want … why shouldn’t you use H tags?

Formatting text

You can also tell search engines that a particular word might be significant in several other ways. If the text is in some way different from most of the other text in the page, search engines may assume that it has been set off for some reason, that the Web designer has treated it differently because it is in some way different and more significant than the other words.

tip Here are a few things you can do to set your keywords apart from the other words on the page:

· Make the text bold.

· Make the text italic.

· Use Title Case (uppercase the First Letter in Each Word and lowercase the other letters in the word).

· Put the keywords in bullet lists.

tip For each page, you have a particular keyword phrase in mind; this is the phrase for which you use the preceding techniques.

Another way to emphasize the text is to make a piece of text larger than the surrounding text. (Just make sure that you do this in a way that doesn’t look tacky.) For example, you can use <H> tags for headers but also use slightly larger text at the beginning of a paragraph or for subheaders.

Creating links

Links in your pages serve several purposes:

· They help searchbots find other pages in your site.

· Keywords in links tell search engines about the pages that the links are pointing at.

· Keywords in links also tell search engines about the page containing the links.

You need links into — and out of — your pages. You don’t want dangling or orphan pages — pages with links into them but no links out. All your pages should be part of the navigation structure. It’s also a good idea to have links within the body text, too.

Search engines read link text for not only clues about the page being referred to but also hints about the page containing the link. I’ve seen situations in which links convinced a search engine that the page the links pointed to were relevant for the keywords used in the links, even though the page didn’t contain those words. The classic example was an intentional manipulation of Google, late in 2003, to get it to display George Bush’s bio (www.whitehouse.gov/president/gwbbio.html) when people searched for the term miserable failure. This was done by a small group of people using links in blog pages. Despite the fact that this page contained neither the word miserable nor the word failure, and certainly not the term miserable failure, a few dozen links with the words miserable failure in the link text were enough to trick Google. (After several years, Google put a stop to the miserable failure situation, but the principle still works. I discuss this Googlebomb in Chapter 16.)

So when you’re creating pages, create links on the page to other pages and make sure that other pages within your site link back to the page you’re creating, using the keywords you placed in your <TITLE> tag.

Don’t create simple Click Here links or You’ll Find More Information Here links. These words (also called anchor text) don’t help you. Instead, create links like these:

· For more information, see our rodent-racing scores page.

· Our rodent-racing background page provides you with the information you’re looking for.

· Visit our rat events and mouse events pages for more info.

Links are critical. In Chapters 16 through 19, you find out about another aspect of links: getting links from other sites to point back to yours.

Using other company and product names

warning Here’s a common scenario: Many of your prospective visitors and customers are searching online for other companies’ names or for the names of products produced or sold by other companies. Can you use these names in your pages? Yes, but be careful how you use them.

Many large companies are aware of this practice, and a number of lawsuits have been filed that relate to the use of keywords by companies other than the trademark owners. Here are a few examples:

· A law firm that deals with Internet domain disputes sued Web-design and Web-hosting firms for using its name, Oppedahl & Larson, in their KEYWORDS meta tags. These firms thought that merely having the words in the tags could bring traffic to their sites. The law firm won. Duh! (Didn’t anyone ever tell you not to upset large law firms?)

· Playboy Enterprises sued Web sites that were using the terms playboy and playmates throughout their pages, site names, domain names, and meta tags to successfully boost their positions. Not surprisingly, Playboy won.

· Insituform Technologies, Inc. sued National Envirotech Group after discovering that Envirotech was using its name in its meta tags. Envirotech lost. The judge felt that using the name in the meta tag without having any relevant information in the body of the pages was clearly a strategy for misdirecting people to the Envirotech site.

So, yes, you can get sued. But then again, you can get sued for anything. In some instances, the plaintiff loses. Playboy won against a number of sites, but lost against former playmate Terri Welles. Playboy didn’t want her to use the terms playboy and playmate on her Web site, but she believed she had the right to, as a former Playboy Playmate. A judge agreed with her. The real point of the Terri Welles case is that nobody owns a word, a product name, or a company name. They merely own the right to use it in certain contexts. Thus, Playboy doesn’t own the wordplayboy — you can say “playboy,” and you can use it in print. But Playboy owns the right to use the word in certain contexts and to stop other people from using it in those same contexts.

remember If you use product and company names to mislead or misrepresent, you could be in trouble. But legally speaking, you can use the terms in a valid, nonfraudulent manner. (Again, this doesn’t necessarily protect you from getting sued.) For instance, you can have a product page in which you compare your products to another, named competitor. That’s perfectly legal. No, I’m not a lawyer, but I’m perfectly willing to play one on TV, given the opportunity. And I would bet that you won’t be seeing the courts banning product comparisons on Web sites.

If you have information about competing products and companies on your pages, used in a valid manner, you can also include the keywords in the <TITLE>, DESCRIPTION, and KEYWORDS tags, as Terri Welles did:

<META NAME="keywords" CONTENT=" terri, welles, playmate, playboy, model, models, semi-nudity, naked, censored by editors, censored by editors, censored by editors, censored by editors, censored by editors, censored by editors, censored by editors, censored by editors">

And there’s nuthin’ Playboy can do about it.

What’s ironic is that firms are being sued for putting other companies’ names and brand names in their KEYWORDS tags, when it has little or no influence on search engine rank these days.

Creating navigation structures that search engines can read

Your navigation structure needs to be visible to search engines. As I explain earlier in the section “Seeing Through a Search Engine’s Eyes,” some page components may be invisible to search engines. For instance, a navigation structure created with JavaScript may not be deciphered by some search engines. If the only way to navigate your Web site is with the JavaScript navigation, you could have a problem; some search engines may not be able to find their way around your site. (As discussed in Chapter 3, Google’s pretty good at reading JavaScript these days; some other search engines may not be.)

tip Here are a few tips for search engine–friendly navigation:

· If you use JavaScript navigation or another technique that could be invisible (which is covered in more detail in Chapter 8), make sure that you have a plain HTML navigation system, too, such as basic text links at the bottom of your pages.

· Even if your navigation structure is visible to the search engines, you may want to have these bottom-of-page links as well. They’re convenient for site visitors and provide another chance for the search engines to find your other pages, provide more keywords on the page, and provide more keyworded links to other pages.

Yet another reason for bottom-of-page, basic text navigation: If you have some kind of image-button navigation, you don’t have any keywords in the navigation for the search engines to read.

· Add a sitemap page and link to it from your main navigation. It provides another way for search engines to find all your pages.

· Whenever possible, provide keywords in text links as part of the navigation structure.

Blocking searchbots

You may want to block particular pages, or even entire areas of your Web site, from being indexed. Here are a few examples of pages or areas you may want to block:

· Pages that are under construction

· Areas of the site that you don’t really want indexed by the search engines

· Pages with information intended for internal use (you should probably password-protect that area of the site, too, of course)

In the past, many Webmasters have also blocked directories in which they stored scripts and CSS style sheets. However, Google recommends that you do not do this, as Google actually does read and interpret scripts and style sheets these days.

remember Using the ROBOTS meta tag or the robots.txt file, you can tell search engines to stay away. The meta tag looks like this:

<META NAME="robots" CONTENT="noindex, nofollow">

This tag does two things: noindex means “Don’t index this page” and nofollow means “Don’t follow the links from this page.”

To block entire directories on your Web site, create a text file called robots.txt and place it in your site’s root directory — which is the same directory as your home page. When a search engine looks at a site, it generally requests the robots.txt file first; that is, it requestshttp://www.domainame.com/robots.txt.

The robots.txt file allows you to block specific search engines and allow others, but Webmasters rarely do so. In the file, you specify which search engine (user agent) you want to block and from which directories or files. Here’s how:

User-agent: *
Disallow: /includes/
Disallow: /scripts/
Disallow: /info/scripts/
Disallow: /staff.html

Because User-agent is set to *, all searchbots are blocked from www.domainname.com/includes/, www.domainname.com/scripts/, www.domainname.com/info/scripts/ directories, and the www.domainname.com/staff.html file. (If you know the name of a particular searchbot that you want to block, replace the asterisk with that name.)

warning Be careful with your robots.txt file. If you make incomplete changes and end up with the following code, you’ve just blocked all search engines from your entire site:

User-agent: *
Disallow: /

In fact, this technique is sometimes used nefariously; I know of one case in which someone hacked into a site and placed the Disallow: / command into the robots.txt file — and Google dropped the site from its index!

Google now has a nice little robots.txt test tool in its Webmasters Console; see Chapter 13 for more information.