Adam Vandenberg ([info]piehead) wrote,
@ 2007-02-04 10:29:00
Previous Entry  Add to memories!  Tell a Friend!  Next Entry
Entry tags:ia, urls, web

Simon Willison: Why you should be using disambiguated URLs.
It's sad that its 2007 and sites still get this wrong.

Way back in 1998 I was working on Encarta Online. At some point, the marketers came in and demanded that we add some tracking information to our search results. The first search would get an "o=1" parameter, the next "o=2" and so forth, so that we could tell how far down the search list people were actually clicking.

This after I had already spent a lot of time convincing people that the link to an article always needed to be the same. That meant cleaning up our (duplicated) URL-generating code to make sure that the query string parameters in a link to an article always came in the same order, that case mattered, that all legacy querystring parameters were stripped, etc. Why is that important? Caching!

If the characters in a URL are different, the browser won't cache the item, even if it's the "same" from the server's point of view.

When viewing an encyclopedia article you end to click around the links in an article, and there tend to be clumpy webs of articles that all link to each other. Since this is 1998 and you're on a crappy dial-up line, and maybe not even a 56k line, you want to try your best to make sure a user doesn't have to download an article twice in the same session.

The "o" parameters from the search page kill this. Not only do links between articles not have an "o" parameter, but different searches have different result orders, which yields different "o"s.

I fought, I was overruled, I gave in and stopped caring.




(Post a new comment)


[info]_fluffy
2007-02-04 07:03 pm UTC (link)
It really bothers me that this is such a big issue even in software packages written by people who should know better, like Wordpress. I really hate how many Wordpress-based weblogs have like 'blahblah/page/5' and so then there's no permanent archive link to anything anymore. Not really that big a deal for navigation, but it makes referrer-tracking lame, as well as things like on Jamglue it sends a message whenever one of your items is embedded on a page, and so now I have like a hundred 'your song has been embedded on somedomain/[tag]/[page]' messages, for several dozen combinations of [tag] and [page].

(Reply to this) (Thread)


[info]photomatt
2007-02-04 08:34 pm UTC (link)
"so then there's no permanent archive link to anything anymore."

WordPress, like all blog software I can think of, supports single-item permalinks. URLs like you refer to are really navigation, and it's at the discretion of the theme author whether they include full content or not.

(Reply to this) (Parent)(Thread)


[info]_fluffy
2007-02-04 08:45 pm UTC (link)
The fact that so many sites are broken in this way tells me that it's a default setup in the most common themes. The users of the weblog software shouldn't have to know better.

(Reply to this) (Parent)


[info]photomatt
2007-02-04 09:28 pm UTC (link)
It's no different than seeing referrers from the front page of Digg in addition to the permalink page. It's a long-standing problem on the web. It's also one of the problems Pingbacks were created to solve, by always providing a permalink-driven referrer to linked content.

(Reply to this)


Create an Account
Forgot your login or password?
Login w/ OpenID
English • Español • Deutsch • Русский…