in SEO by Jeffrey_Smith

Online, there is always someone searching for something; learning how to position your content, site architecture and landing pages in search engines is the premise of SEO.

seo-and-ir

Search Engine Optimization is built on the foundation of information architecture and information retrieval.  As an SEO practitioner, you can gain immense insight for optimization from speaking their language, by lifting the hood to find out what makes search engines tick under the surface. This implies mastering the metrics responsible for creating and fine-tuning relevance; since those metrics are algorithmically responsible  for delivering all relevant natural search results.

Information retrieval systems utilize a combination of web crawlers; index generators; and query servers to collect data which is then parsed, structured and archived [1].

When a query is executed against the stored data, a facet of inverse document frequency uses an [is/is not] shingle level analysis (segmenting data like a game of concentration to find exact and broad match portions of keyword/query data).

In an overly simplified summary, IR (information retrieval) works by breaking data/information into segments and then matching the query with the fragment of data or document most suited as a direct-hit or correlation to the query as a contingent of relevance and relevance score. There are multiple weighting schemas utilized, some of the more popular are the vector space model and term discrimination to parse, assign and sort documents for relevance.

Search engines often cluster probabilistic thresholds for keywords when a suitable amount of topical content is found within a global profile on a website. Typical aspects are known as nearest neighbor queries (NN) or probabilistic nearest neighbor (PNN) level algorithms [2].

In other words, if enough content/nodes exists, and overlap with multiple related objects (your content), when one node/keyword/mention acquires authority (i.e. 75-100 occurrences, pages, internal links, inbound links or citations), relevance from one node to another will spill over into other topical areas and rankings/from retrieval from the index can emerge that cross-reference the gap between the two or more keywords, documents, etc. i.e. the process of stemming.

In other words, as you add more topical information of a subject, the threshold of relevant data assigns a profile to your pages and your website for containing a specific degree of data on the subject or topic.

Herein is the basis of adding topical content to reinforce continuity on a subject as well as create a broader array of destinations for query space inquiries. For example, if you search for “purple snafenhalgers” and everything that is not purple or contains any of the key shingles comprising the cluster re-cog-nized  as “snafenhalgers”, is instantly eliminated; particularly, if no reference or citations exist to solidify the existence of the sought data to match against the index.

Search engines are programmed to retrieve the most relevant document and based upon the on page and off page SEO factors your pages exhibit. Search engine optimization uses components such as coherent title tags (that summarize the contents of that page), h1/heading tags, multiple occurrences of singular and/or plural synonyms (containing a shared subject matter) and the internal and external link graph both to and from that page; referenced within a site or from other websites determines the degree of authority a page or website has for a said occurrence/term.

It is an aspect of global term weights, the relevance model /weighting scheme and page level thresholds, but inverse document frequency otherwise known as IDF [3] parses both page level and site level occurrences of keyword occurrences when looking for an ideal match.

For example, if you have multiple pages on “search engine optimization” and multiple pages on “marketing or marketing tips”, depending on the correlations used and the query used (such as “search engine marketing tips”), your page could be returned as a direct hit.

Internal links and virtual theming are an effective method to assign a specific pecking order for specific keyword queries. Aside from that, link and domain popularity, click through data and the volume of content or inbound links to each page can shift search engine ranking factors in your favor.

For instance 300 pages on a topic, all internally linked to a landing page, as well as deep links to each of those pages are enough to produce tangible shifts in rankings. On the contrary 20 pages could be enough (depending on the query, industry size, how many quality links are required to cross the tipping point).

Considering that only the top 1000 results matter, there are always websites with a clear advantage; based on when they started optimizing their content, and how much time they have had to cultivate their collection of documents and popularity on the subject.

Google organic search results rewarded topical relevance by adding a plus sign accompanied by a double ranking or third ranking in the top 10 results stating “show more results from” with the search result (thus indicating a high concentration of the said terms).

At the end of the day, in order to dominate the SERPs (search engine result pages), you need relevance and popularity. Despite the obvious, built more content and links takeaway, there are other more significant undertones to this article.

To summarize, there is no need to recreate the wheel, study what works and find out why (such as reading a few papers on information retrieval) and then when something works, you know why and how to reproduce it.

SEO is an ever-changing endeavor, by studying the principles that support it, you can gain significant competitive advantages and apply them to a content management system, your approach towards content development or your internal or external linking practices. 

Read More Related Posts
Tiered SEO Navigation and Link Structure
In our previous post called Search Engine Optimization: Creating Tiered Structures, we touched briefly on the importance of site structure and creating cascading or faceted / tiered navigation for search ...
READ MORE
Anyone Can Pay for Traffic, but Who Can Earn it?
As you gathered from the title, the topic of traffic (paid or organic) is the subject of today's post. Let’s look at the pros and the cons of both SEO ...
READ MORE
How Does Your Website Measure Up?
Google officially updated the TPBR Page Rank last night. For those in the SEO community, the significance is met with mixed emotions ranging from dismissive to celebratory. Although page rank ...
READ MORE
Duplicate Content, What is it and How to Avoid it!
You've heard of duplicate content, but how do you know if you are creating it or how to stop it from penalizing your website? If you have a WordPress site, ...
READ MORE
Does your online marketing make dollars and sense?
Making money with your website is not as difficult as the gentlemen in this graphic depicts.  However, it does require that you understand the basic premise of relevance, positioning and ...
READ MORE
You've Got Traffic, Now What?
The natural assumption is that SEO involves just choosing a few keywords, making a few changes and presto, your website magically appears in the top 10 results (preferably in the ...
READ MORE
SEO Method#1 to Produce Keyword Stemming
Often the effectiveness of search engine optimization is predicated by keywords and SEO rankings. Since this is a primary metric for performance, I would like to share a previously guarded ...
READ MORE
SEO Marketing, Authority and Scalability
From the perspective of SEO, each domain requires a different type of tact, depending on where it is in the cycle of developing authority. However, SEO is only one channel ...
READ MORE
Rich-Media and Moving Past Traditional Marketing Mediums
Have you ever had a day where you wanted to be interrupted and watch commercials or spend countless moments being diffused from your purpose? Didn't think so, this is precisely ...
READ MORE
Finding the Proper Balance of SEO
Finding the proper SEO ratios for content, links and competing pages is critical for determining where each keyword fits in the hierarchy. This correlation between site architecture, URL, title, internal ...
READ MORE
Tiered SEO Navigation and Link Structure
Anyone Can Pay for Traffic, but Who Can
Google Toolbar PageRank Update October 2009
Duplicate Content, What is it and How to
SEO Marketing Meets Dollars and Sense
SEO is Great, But What Happens Next?
SEO Rankings and How to Create Them
SEO Marketing: Social Structure and Scalability
Advertising and Advertisers: Meet SEO and Rich Media
SEO Ratios for Content, Links and Competing Pages

About Jeffrey_Smith

In 2006, Jeffrey Smith founded SEO Design Solutions (An SEO Provider who now develops SEO Software for WordPress).

Jeffrey has actively been involved in internet marketing since 1995 and brings a wealth of collective experiences and marketing strategies to increase rankings, revenue and reach.

5 thoughts on “SEO: Search & Information Retrieval
  1. Thanks for sharing the details.It really helped me understand the SEO concept better.

Comments are closed.