Latest Articles

How can I optimize my website, it’s dynamic? The Rewrite Engine! What?

By Webposition SEO Team

By Curtis Friedl

Many people will ask what you are talking about when you refer to a Rewrite Engine. This may be one of those tightly held secrets, which are spoken about only in hushed whispers in the corner of a darkened room.

Today nearly everyone uses a URL several times each day; we share them by email, Instant Messaging, or simply recall them from memory. Many of the dynamic sites of today have URLs that resemble this one: http://www.your-web-site.com/catalog.php?cat=category&product_name&color=product_id. The larger and more complicated a site becomes, the greater the number of variables the URL may contain.

Google makes a statement on one of their webmaster support pages that they will support this dynamic site, but only to a point:

http://www.google.com/support/pages/bin/answer.py?answer=470&topic=8490

“Google does index dynamically generated webpages, including .asp pages, .php pages, and pages with question marks in their URLs. However, these pages can cause problems for our crawler and may be ignored”.

Google goes on to say that “you might consider creating and submitting a detailed Sitemap of your pages using Google Sitemaps.”

The recommendation of a site map is extremely important, and it is one that every website owner needs to ensure they follow. Providing the search engines with a quick and easy way to locate all of your pages should be common sense. However this does not provide the complete picture of creating a URL that is easier to use and one that is short and descriptive. The URL should have information describing the content (not just a list of variables), and it should not be too long.

A rewrite engine like Apache’s mod_rewrite, will accomplish several factors that will be beneficial to a site. First off, this makes website URLs more user friendly so they will be easier to understand and remember when shared with a friend or customer. Additionally this will prevent undesired linking, or stealing of content, and will not expose the inner workings of a site to users. More importantly for SEOs, these new urls are more search-engine friendly. As Google and other engines have trouble with websites that utilize complex variables as discussed above, every step that can be taken to ensure a search engine will be able to quickly locate, and index your content is important. Websites which are easier to index are more likely to have their content included in search results.

A URL rewrite engine can be presented with URLs in one form (one with several variables like the one above) and display information in another form (www.your-web-site.com/subfolder/something-ABC.html). A rewrite engine allows URLs to be cleaned up and made friendly by creating a set of rewrite rules, rather than modifying the configuration of the web server software.

A good example of the setup of a rewrite can be seen in this discussion at Search engine watch http://forums.searchenginewatch.com/showthread.php?t=3925

Should I Submit Again?

By Webposition SEO Team

By Scott Goodyear

Have you launched a new web site for yourself or a client, submitted to the search engines, waited a few months, and were left wondering if the site had gotten into the search engines? Maybe you’ve found the URL you’ve submitted but don’t know if they’ve only picked up the URL, or if they had picked up the content from your pages? You may also be wondering from time to time whether you should resubmit or not? In this article, we will explore some of the methods that you might use to answer these questions.

Extended The Invitation To Visit.

When working on the content of a site, gaining inbound links, searching for link partners, etc., you may have submitted a few pages to the major search engines and received a vague notice that your site has been added to a list of sites to potentially crawl and index. Here are a few examples of messages you may have seen:

google_submit_verification.gif

yahoo_submit_verification.gif

A few months will likely pass and at that point, it is usual that the average site’s submitted pages will not be listed in the top100 let alone top 1000 rankings. Should you resubmit? The answer to this question may lay in answering this: was the site’s home page, or at least a few pages within the site even indexed?

Try searching with a few URLs as ‘keywords’. For example: www.marketposition.com/blog/archives/2004/12/index.html on Netscape.com This can be an important step since you may not find the URL under a specific keyword or phrase, but you will want to at least know if the engine is aware of the site. This can be a bit time consuming since you might need to check for URLs with and without the “www.” in them. You may wish to consider using some of the advanced search operators like:

site:www.site.com
inurl:www.site.com
domain:www.site.com

The actual search operators can vary based on the engine you are using, but most sites will offer an advanced search option or domain search option if you do not wish to bother with search operators. There are also engines like Yahoo! that have an exploration tool to simplify this process.

Did An Engine Just Knock At The Door?

If you have a web analytics package or know how to review your site’s log files, you can see whether a search engine robot has been visiting your site. Many search engines robots have obvious names like Google Bot, MSNbot, etc. or unique names like Slurp, rather than the simple IP address/browser name combinations that most of your human visitors will have. If you are taking part in a search engine’s paid advertising program, this can mean that your site will be indexed not only by a normal search spider but could also be analyzed by an advertising program spider. While advertising and submission services will not typically cause your site to gain an extra rankings boost, it can mean that your site will be indexed faster than letting the spider naturally crawl your site and incidentally, may mean that your site information is updated more often than a non-advertiser.

I Can’t Believe They Still Have THAT In The Index!

Some time has passed and you’ve likely had a few pages that have been indexed. From time to time, engines will return to your site to check for updates automatically. When others link to your site, the engines can also follow those links back to your site and re index your site as well. This is why larger, more popular sites like CNN.com, BBC.co.uk, etc. are often quite up to date in the engines. Yet when looking at listings from your own site, you may find information that is out of date or different from the content that is currently on the page.

If your indexed content is out of date, first check your analytics package/log files as mentioned previously. Has the engine been by in the past few months? If so, it may simply take a bit longer for the engine to update its database and spread the updates throughout its various data centers. If you do not have this type of option available to you, some engines will allow you to check their ‘cache’ directly from their search. Here is a search in MSN for the URL:
http://www.marketposition.com/blog/archives/2005/09/controlling_sea.html

msn_url_cache.gif

Search, then click on the ‘Cached page’ option.

msn_url_cache_detail.gif

On the next page, at the top there is a note about when MSN had last examined this page. Since they have visited recently, I would not resubmit this even if the contents of the page had changed on say 4/7/06. The engine’s robot should come back to visit the page in the future whether I resubmit or not. If the date is quite old, such as 3-6+ months out of date, you may wish to resubmit if the page has updated information. If the content on the page is the same as the cached copy, there is no reason to resubmit and resubmission at that point will not help to improve your rankings.

How ‘engine friendly’ is your site? Some sites that rely heavily on flash, javascript, or other technologies are harder to interpret and index than those based on plain HTML/text. Search engines can choose to display listings from a directory like The Open Directory Project (DMOZ.org), if a search engine has a problem reading your page. You can try to get your DMOZ listing updated however it might be better to fix your page so that it is more search friendly. The text based Lynx web browser is ideal for displaying a very basic text version of your page, similar in many ways to what an engine might see. If you do not have a DMOZ listing you may simply end up with an entry for your URL, but no description. This is sometimes referred to as a ‘partial listing‘.

Many engines have multiple data centers. While the engines often update their data centers on a continuous basis, from time to time you could notice that a page seems to be partially indexed, such as a URL with no text description. If you perform a unique text string search, you may find that the page’s content has indeed been indexed, but perhaps the information has not yet been linked to your normal listing due to a canonical issue (i.e. your site was indexed twice, once under www.site.com and once under site.com where one or the other shows up with the text listing). If you do find that one or the other is listed with text and the other is not, you may need to work on creating a 301 redirect rule for your site.

So how do you perform a URL verification search? Again, I will use this page: http://www.marketposition.com/blog/archives/2005/09/controlling_sea.html
I can look for unique text that other sites would not typically have. I could probably use the title of the article, a combination of the author name and a keyword from the page, or other text information that is fairly unique like:

‘Controlling Search Engine Robots With robots.txt and Other’ Google Yahoo MSN
‘Scott Goodyear Robots.txt’ Google Yahoo MSN
‘conserve bandwidth (data transfer) since some engines will completely skip files and folders indicated’ Google Yahoo MSN
(Some words like and, or, the, .com, and others are known as noise words or stop words which most engines will ignore because they are so common.)

As an aside, if you have a list of several URLs that you use to periodically verify unique text, you can check these URLs from time to time (once you have first verified that the URL is pulled up with your unique text) and submit to the engines if your URL no longer shows up with the unique text. At that point you will know that it is a good time to resubmit. If you are using Web Position 3 or the newest Web Position 4 software, a URL verification feature is built into the Reporter and Submitter modules. Simply create your Reporter mission as normal, follow the URL verification steps from just above, and if the URL is found, enter this information into the Reporter mission. Save and close this mission. Create a Submitter mission and check the URL verification option. Save and close this mission. Start a new Scheduler event, set it to run say once a month with these two new missions and WebPosition will run once a month to make sure that your pages are still indexed and if not, resubmit them for you.

How URLs Can Affect Top Search Engine Rankings

By Webposition SEO Team

Strategies you need to know about…
How URLs Can Affect Top Search Engine Rankings
…and everything else that’s nice to know about them too! – By John Heard

Courtesy of SearchEngineNews.com | February 2006 Planet Ocean Communications, the top-rated source of search engine marketing information.

You’ve seen it a million times; you even know it by name – URL. You know that URL stands for Uniform Resource Locator and you probably refer to it by its 3-letter acronym: “U-R-L.” Or maybe you’re one of the cool kids who calls it an “Earl.” Either way, you may not know how URLs can affect your search engine optimization (SEO) strategies. Well, move over cool kids, ’cause you’re about to learn something new…

Let’s begin with the basics so that later, when we drill down into the important need-to-know details pertaining to SEO strategy, you’ll be perched on a solid knowledge-base and primed to follow through when it comes time to implement what you’ve learned.


The nice-to-know stuff…

A typical URL, like the one seen below, can be broken down into the following individual components:

http://www.domain.com/subdirectory/filename.html 

* The http stands for Hyper Text Transport Protocol, which defines the method to be used to view the resource. Basically, that’s what tells us a webpage is reachable via a web-browser and that search engines can index it. It also defines which Communication Port to use – in this case the default for http is Port 80. Different protocols typically use different ports.

* WWW is the section of a domain name commonly referred to as the subdomain. Most websites use either www or else no subdomain at all. However, many large composite websites make use of unique subdomain names to differentiate between different major sections, services, or topics within their sites. For example news.google.com or blog.company.com.

Subdomains can have multiple levels, for example new.pressreleases.company.com or, if you watch some of your emails where people are trying to trick you into giving out your private account details (i.e., phishing schemes), you might see something like www.citicorp.com.domain.somewhereelse.ru.

Subdomains can have many levels and, technically speaking, there are few restrictions to their length or number. Subdomains can and typically do translate to a separate IP address than the primary domain name. For non-SEO applications, a unique subdomain often is a different webserver but, nevertheless, it remains under the control of the same primary domain owner.

For example, only Google can make use of uniquename.google.com because the DNS (Domain Name Server) addressing is tied to their primary domain, google.com, which is solely under Google’s control.

Subdomain names are not case-sensitive. In other words, News.Google.com is the same as news.google.com.

* domain refers to the domain name section of the URL. It, combined with the .com, is translated by the Domain Name System into a numeric IP address which is used to connect to the actual webserver that hosts the resource. A domain name is also known as the hostname which refers to the name of the computer where it is hosted. A domain name must be composed of letters, digits or hyphens and are limited in number to 63 characters in length. Letters A-Z, numbers 0-9 and hyphens are the only legal characters allowed in a domain name.

Domain names are not case-sensitive – MyDomain.com is the same as mydomain.com.

* The last part of the domain name, .com, refers to the Top Level Domain or TLD. Like subdomains and domain names, the TLD is not case sensitive. TLDs are classified by three types:

* Generic: .com, .org, .gov, .edu, .biz, etc.
* Country Codes: .us, .uk, .jp
* Infrastructure: .arpa (the only one)

In this report we need only be concerned with Generic and Country Code TLD’s for SEO purposes.

* Finally, let’s define the elements beyond the TLD that complete the breakdown of our sample URL.

* subdirectory – Refers to a subdirectory or what appears to be a subdirectory on the webserver.
* filename – Refers to the document file on the webserver.
* .html – the filename extension that typically tells us what type of document. In this case, .html tells us the document is written in Hypertext Markup Language; in other words, it’s a typical webpage.

Everything in the URL beyond the TLD – i.e., subdirectory, filename, and extensionsare all case-sensitive. In other words, index.html is not the same as Index.Html


The need-to-know stuff…

Now, let’s examine how each of these elements should be integrated into your search engine optimization and marketing strategies.

Subdomains

It’s common knowledge that the root of SEO is keywords. Since the beginning, all strategies have revolved around the ever-changing and constantly evolving targets for keyword placement. In the swirling sea of anchor links, meta tags, headlines, body copy, and what-not, one of the most consistently useful placements in terms of SEO has been within the URL itself. And the degree to which this strategy remains effective is simply a matter of how far the engines scale-it-up on the algorithmic dial.

This is why the use of keyword-specific subdomains has been a long standing and typically effective SEO strategy. When viewed from the search engine’s point of view (SEPOV), this also makes sense. That’s because when a keyword is found within the URL there is typically a very high probability that the keyword is relevant to the webpage’s content – and search engines hunger to provide relevant results.

So, keyword placement in the subdomain accomplishes two things:

1. It provides a clue to a potential site visitor what the site is about prior to clicking the link in the search results (as would be the case with a domain like CheapAirlineTickets.mysite.com).

2. It gives the search engines an additional relevancy indicator which they can choose to use – a little, a lot, or not at all – within their overall relevancy algorithm. Over the years, we have found that search engines cannot resist using it to varying degrees – even if only a little.

Think about it like this: If all else is equal (which is only hypothetically possible), the site with the keyword in the subdomain will likely rank higher than an otherwise equally optimized site without it. And, perhaps more importantly, given a choice between two otherwise equally attractive selections within the search results, the average potential site visitor will choose the link containing the keyword in the subdomain over the link that does not.

Utilizing a subdomain is one of the best legitimate strategies for placing your primary keyword into the URL when the specific keyword – TLD name is already taken or otherwise unavailable.

Currently, Google, Yahoo, and MSN each appear to be giving some ranking boost to pages that contain the keywords in the subdomain of the URL. We believe that Yahoo places the most significance on keywords in the domain name, closely followed by Google. MSN doesn’t seem to place quite so much emphasis on this, but it does appear to factor it into their scoring.

We can’t help but notice that most of the top pages in the search results for very competitive searches contain the keyword either in the domain or the subdomain name. One good example is a search for music. On each of the big three engines you’ll find many of the top results have the keyword music either in the subdomain, primary domain, or in some cases, the subdirectory – or else you’ll find a keyword (like mp3 or mtv or itunes) that is synonymous with music.

Although there isn’t any question that keywords in the subdomain can help rankings, some cautions are in order. Taking into account that subdomains can be separated by a period – making possible certain combinations like keyword.keyword.domain.com – we recommend sticking to the more customary single level format; keyword.domain.com. Rarely do you see multilevel subdomains ranking for the more difficult generic searches.

Another possibility is to delimit your subdomain using dashes. For example keyword-keyword.domain.com is also a technically available possibility. However we don’t recommend using more than one dash in a subdomain even though we have seen some good results at MSN using more than that. Regardless, unless you’re optimizing solely for MSN (unlikely) you should limit subdomain dashes to one, or none.

A keyword placed in a subdomain is not only a ranking factor, but also a linking factor. Frequently, when another site links to your site, they use the domain name as the anchor text of the link. Placing your best keyword in the subdomain means that other sites will be compelled to use that keyword in their link to you. That will score your page some extra relevancy points from the search engines point of view.

Another caution to be aware of is wildcard subdomains where anything.domain.com results in the same page as anythingelse.domain.com. Search engines have a VERY dim view of this practice and we don’t advise using it.

Here’s another warning: Don’t create a subdomain when there isn’t a good user focused reason to do so. Avoid having subdomains with only one or two pages on them – a small number of pages on a subdomain (other than www.) is a red flag to a search engine. You can expect that ranking penalties or outright bans could be levied on any site that combines the typical www.domain.com format with a bunch of low-populated keyword-laced subdomains in an obvious effort to manipulate rankings.

If in doubt, it’s always a good idea to review how some of the major search engines are structuring their file systems. For example, look at Google. Most of their site is at www.google.com. However they assign a subdomain to certain specific and large areas of their site. For example news.google.com, groups.google.com, froogle.google.com, and local.google.com.

Each of these subdomains are logical separations and, clearly, they have good reason for dividing their site into these subdomains. A contrasting example would be a shoe site that uses nike.sitename.com, adidas.sitename.com and so forth. We’re not saying this wouldn’t be effective, only that it’s pushing the envelope and the next algorithm tweak might land the whole site in the penalty box. If you do it, be sure to place plenty of unique content within each of the subdomains. Otherwise it’s sure to be viewed as a spam technique – so beware.

Domains

Your domain, of course, is your registered domain name. For example, this site’s domain name is SearchEngineNews. Naturally you don’t have the flexibility of modifying it in the way that you can with subdomains. Regardless, the ranking advantages enjoyed by keyword-smart subdomains also apply to domains.

Clearly, it helps considerably to have good keywords in your primary domain – both from a search engine and a consumer perspective. The ranking boost is most profound when the domain name exactly matches the keywords being searched on. However, due to past abuses, there are some unwritten restrictions and warnings to be aware of.

At one time, keyword-keyword-keyword-keyword.com had a boosting effect on rankings, but that didn’t last long. The engines were quick to realized they were being gamed and began counting hyphens and domain name character lengths. As you might expect, they were able to correlate multiple dashes and long domain names with spam sites. It didn’t take them long to restrict how high a long, multi-dashed domain name could rank.

Today we recommend no more than one dash, and the shorter your domain name, the better. In a perfect scenario, your best domain name is typically your primary keyword or keyword combination. Whenever that isn’t possible, at least try to get the most important keyword mentioned somewhere in your domain name. Otherwise, expect to settle for placing it into your subdomain.

To hyphen or not? A few years back, hyphenated keywords within some domain names enjoyed a small ranking advantage. Hyphens were used to delimit the text so the search engines could more easily distinguish each keyword within a phrase without mistaking it for some unique term. However, today we’re seeing more indications that the major engines are getting better at picking keywords out of a phrase without the help of dashes – at least in the English language.

So today, whenever faced with a choice, you should favor the keywordkeyword.com over the keyword-keyword.com URL. While it’s true you may want to secure both URLs to keep the other one out of the hands of your competitor, you would be better advised to develop the URL without the hyphen as your primary site.

Subdirectories

If you can’t get the keyword into the domain name and it isn’t advisable to put it into the subdomain, then your next best option is to place it in the subdirectory name.

Bear in mind that it really doesn’t matter if you use a subdirectory or a file name to contain the keyword, but typically one (but not both) will help search ranking. In most cases, /keyword/index.html, ranks equally with /keyword.html.

Regardless, you will likely have better results if part of the file structure contains at least some part of your keyword phrase. Subdirectory names are also sometimes important for other reasons. For example, take /cgi-bin/. Historically that’s been a subdirectory avoided by search engine robots for fear of getting trapped in an infinite loop and indexing millions of unique URLs that are actually just duplicate pages.

For the most part, however, search spiders have solved this problem and will index URLs that contain cgi-bin. Still, we suspect some limitations still remain, so, ideally, you should avoid using that specific subdirectory name whenever possible.

Other similar directory names that may have built-in limitations are popular software program names like /phpBB/ for the PHP Bulletin Board software and /Gallery2/. If you install one of these programs we would recommend that you use a unique directory name whenever possible just to ensure you’ll avoid whatever limitations to indexing that might still be lingering.

File names

Just as with subdirectories, having the keyword as the file name is usually a good idea. However, be forewarned that you don’t want to overdo any part of this. One mention of the keyword in the URL is often good enough. Multiple repetitions in the URL are typically associated with spam and ranked lower.

For example http://www.domain.com/keyword.html is ok, but http://www.keyword.com/samekeyword.html is probably overkill. Examine the search results for your keywords and look at what is scoring at or near the top. You’ll see the pattern quite clearly as to what is good and what isn’t.


File Extensions

This is a frequent question that pops up in our tech Q&A’s – Does the file extension affect my search ranking? Typically NO, it does not, at least as long as the extension is one that is commonly associated with a web page.

We have not seen a case where this mattered at all in the last couple years. Of course, .html or .htm is the most often used file extension. But more and more we are seeing file extensions in the top 10 search results that include .cfm, .php, .asp, and .aspx. Ranking-wise, all of these file extensions are equal in the eyes of the engines.

More advanced-need-to-know stuff…

Here are some important points to help you close the technical loopholes on your SEO marketing strategies…

Fine tune your URL structure

In general, URL structure for SEO optimization follows a general rule that…

the more generic your keyword, the earlier you want it in your URL structure.

For instance, if you want to score for the extremely generic term music that returns more than a billion search results at Google, you should definitely place it in either your domain or subdomain name.

However, if you want to score for a specific model number for a cell phone your keyword wouldn’t be anywhere near as popular as the keyword music. So, using your keyword – the model number – as a subdirectory or file name will typically work quite well.

Also, you should be wary of over-doing it. A good rule of thumb is this:

If the URL looks like spam, it probably will be treated as spam.The search engines caught on a long time ago to the www.viagra-pills-porn-casino.com/ style domain names and such similar file structures. Today you want to use domain and file name structures that appear to be common sense to the human visitors of your site. Always bear in mind that people do look at the file names within the search results. And seeing the keywords highlighted in the URL does increase click-through-rates.

Static vs Dynamic URLs

A static URL looks like this:
http://www.domain.com/pagename.html

A Dynamic URL is typically characterized by having certain characters like ? after the file extension. For example:

http://www.domain.com/subdirectory/filename.php?variable=
xyz&variable=123&2ndvariable=456&3rdvariable=789

Everything after the ? mark is typically a variable.

Remember, many dynamic systems do not need all of their variable information in order to work correctly. Many programmers go overboard on the variables they add to the URL structure.

Today’s major search engines have added capabilities for crawling dynamic URLs that weren’t available a few years ago. Still, there are limitations. A site rarely gets fully indexed when there are more than three variables in a dynamic URL. Ideally you should avoid using more than two.

There are also some specific variable names – like ID – that indexing-bots often avoid. That’s due to what are called session variables – variable that is unique to each visitor to the site. The use of session variables often results in many, many, duplicate URLs being indexed in the search engines. That’s why the bots do their best to avoid URLs that appear to have a session variable.

If you are operating a dynamic site you should, ideally, avoid using session variables whenever there is an alternative. And, if not, find a way (like using robots.txt) to prevent the search engines from crawling those URL paths.

The most effective way to prevent indexing problems with session variables is to use a good IP delivery program that will recognize search engine spiders and make sure they only receive URLs which have had the session variables removed.

Note: a large number of URLs with session variable have begun appearing in Google recently. This appears to be a glitch in Google’s indexing process that should be resolved soon, as Google states on their webmaster guidelines page that they do not index pages with session variables.

It’s actually a major headache for many webmasters, since it means that many web pages are getting indexed multiple times under different URLs. All the more reason to employ a solution which prevents search engines from being served session variables.

Absolute Vs. Relative URLs

Another question that often comes up about URLs is how to refer to them in the HREF section of a link. There are two options here: absolute or relative. An absolute link means you use the entire URL:

http://www.domain.com/filename.html

…where a relative link simply refers to:

filename.html

While either link will work just fine when referring to pages on the same domain, absolute URLs are preferred. They’re better for a few minor reasons, like…

* The links will work if someone steals your content or saves it to their desktop.

* Absolute URLs help avoid getting a dumb-bot stuck in a loop that increases server load and generates 404 errors.

Admittedly, it’s a minor point. But, if you’re striving toward SEO perfection, absolute links are part of the perfect package. Anything you can do to make it easier for search engines to index your pages is a good thing.

An Extremely Important Point

Keywords in URLs can help rankings, but remember that changing your existing URLs without redirecting it to a new one will break all the links and bookmarks pointing towards that URL, causing the page to drop out of the search engines.

This means that if you have an existing high-ranking page, it’s almost never a good idea to change the URL. If you must change it, be sure to use a 301 redirect to make sure links, user traffic, and search engines are properly sent to the new URL. For a complete tutorial on using the 301 redirect to change URLs, see our report:

Unraveling the Versatile 301 Redirect

Silver Bullet?

The continuously evolving arms race of search engine optimization and marketing is the science of piecing together all of the useful components in ways that can only help, and never hurt your ranking and consumer marketing efforts. Intelligent placement of keywords within URLs is an integral part of SEO strategy. And, when done correctly, it’s likely to pay ranking dividends far into the foreseeable future.

Whether the search engines like it or not, there is no escaping the fact that keywords in the URL assist them in their mission to provide relevant results. Therefore, by utilizing this strategy, and keeping it looking natural, you’re capitalizing on the one single gimme that search engines can never be expected to completely eliminate.

However, one should also bear in mind that, as a stand-alone strategy, putting the keyword in your URL won’t matter much – it isn’t the silver bullet for high rankings. But, when all else is equal, it is a fact that the webpage with the keyword in the URL will outrank and receive more click-throughs than a page without it!

Because success depends on getting all the little things right within a concerted effort,

John Heard – Technical Analyst
Planet Ocean Communications

How to Research and Utilize Valuable Search Keywords

By Webposition SEO Team

by Richard Drawhorn

Every search engine marketing professional understands the importance of choosing relevant keywords to include in web page content. Many factors must be considered during the keyword selection process and, once a list has been compiled, the keywords must be incorporated into the site content properly. In this article, I’ll outline a general approach to selecting valuable keywords, and also suggest a few things to consider when optimizing your web site content.

At the beginning of any project, it’s always best to define the goals. In the case of keyword research, the goals are to attract qualified traffic to the web site and (in the case of ecommerce sites) to convert visitors into paying customers. Notice that the goal is twofold: the first part involves pleasing search engines, and the second part involves pleasing people. Successful search engine marketers perform well on both of these fronts.

Compiling an Initial Keyword List

The first step in creating a good keyword list for a web site is to assemble a rough list that can then be refined through a selection process. The initial list can be created using a variety of simple techniques:

  • Evaluate your competitors’ web sites to get an idea of the kinds of keywords they have targeted. This can be done easily by looking at the source code of your competitor’s home page and studying the keywords contained within the Meta Keyword tag. If you have a copy of WebPosition software, the Meta keywords can be conveniently extracted from the source code of any web site.
  • Based on your own knowledge of the web site topic, add any keywords you think might be relevant. Imagine that you are somebody using the internet to find information, and add the keywords you might type into a search engine.
  • Use a keyword research tool like WordTracker to generate related keywords and phrases based upon an initial keyword search. Try several different keywords as a starting point, and you should find a broad list of related keywords.  
  • Create new keyword phrases by combining keywords in the list into combinations that seem to make sense, and might plausibly be typed into a search engine. A spreadsheet program like Microsoft Excel is useful to help create and organize these various keyword combinations.
  • Search the major search engine like Yahoo!, Google, or MSN for the top keywords on the list to find web sites that are already ranking well for those keywords. Study these sites and look for new ideas for keywords as well as potential content for your own site.

At this point, the list is probably much larger than expected, and you are wondering which of these keywords are most likely to drive relevant traffic to your site. It’s time to refine the list by weeding out keywords that may attract the wrong types of visitors, or not generate much traffic at all.

Refining the Keyword List

Now that the initial list has been compiled, some additional research needs to be done. Using a keyword research tool like WordTracker, determine how many times each keyword has been typed into search engines. Keep in mind that these numbers indicate how popular keywords are, but the most popular words are not necessarily the best. First, remove any keywords from the list that have very low popularity numbers. Evaluate each of the more popular keywords by taking the following points into consideration:

  • How much competition is there for the keyword?
    If the keyword in question has a million other web sites competing for the top position, it will be difficult to achieve good results. Often words that are very popular may be far too general to be of value. For example, if you are a real estate agent, you would not want to optimize for a general phrase like “real estate”, since there are so many competing sites and the phrase is so general it could refer to real estate anywhere in the world. You would be much better off using phrases that limit the search to the area you work in (”Oregon real estate” or “Portland Oregon real estate”). WordTracker will display the number of competing pages for each keyword, and also calculate a useful parameter called the Keyword Effectiveness Index (KEI) which takes into account both the popularity and the competition for the keyword. Use this information to locate keywords that are popular, but do not have an overwhelming amount of competition. 

     

  • Does the keyword address where the customer is in the buying process?
    The most relevant keywords are those that have a high liklihood of converting a visitor to a paying customer. Often, the keywords typed into a search engine reflect whether the searcher is ready to make a purchase, or whether they are simply looking for information about a topic. Be sure to include keywords in your list that will capture these potential customers. For example, somebody who types “digital camera prices” into a search engine is probably closer to making a purchase than the person who simply types “digital cameras”. You will find many variations of keywords related to your web site topic that provide clues about where the searcher is in the buying process. 

     

  • Use a web analytics tool like WebTrends to gather additional information.
    If you have a web analytics tool available for your web site, then use the data it has gathered for referring URLs. The referring URLs will let you know what keywords people have actually typed into search engines to find your site in the past. Try to locate any patterns in this data, such as keywords that may relate to a current event for example. If your web site has a built in search feature that allows users to search your own web site for content, then analyze that information as well. The keywords that your own visitors type into your local site search engine are often valuable keywords that can be used within your web site content to attract visitors to your site.

 

Now that the keyword list has been refined to contain only the best keywords based on your research, it’s time to integrate these keywords into your content. This is where your knowledge of search engine optimization (SEO) and marketing will be put to use. First, use your SEO skills to place the keywords in the appropriate sections of the page. These include the Title tag, Meta elements, Header, Body, Links, and Alt text in your image tags. A tool such as WebPosition’s Page Critic is useful at this stage. It provides information about the keyword frequency, prominence and other factors typically found in other pages already ranking well in the search engines. By emulating the keyword statistics of high ranking pages, you are improving the liklihood that your page will also be ranked well on search engines.

Create Compelling Copy for your Web Pages

Knowing where to place keywords and at what frequency is only half the battle. These considerations are of course important to achieve high rankings in search engines, but it’s equally important to remember that your web site is there for the benefit of human beings. Good marketers realize that the style of language chosen often strongly affects a potential customer’s course of action. If you write good copy that appeals to your customers, then your web site visitors are more likely to become customers.

If you find that you’re having trouble using the keywords in the right context, then try creating new content that fits the keywords. Create resources that may not apply directly to the sale you are trying to make, but a potential customer might still find useful in some way. For example, if your online business sells gardening tools and supplies online, then you might consider publishing informational articles related to gardening. This kind of valuable content can attract visitors to your web site initially, and appeal to their positive feelings about gardening in general. If the visitor finds something useful on your web site, and in general has a positive experience navigating through the site, then its more likely that they will convert to a customer.

Conclusion

Conducting proper keyword research is a critical component of effective search engine marketing. Before spending time optimizing your web site content, be sure to do the necessary research to identify a list of keywords most likely to drive relevant traffic to your web site.