Information Philosophy

...now browsing by category

What are information and data? How do they represent what they represent? What are we doing when we record and use information? These questions are philosophical, but also deeply pragmatic. This category collects posts that deal more with this sort of question than with technical issues.

 

The two cultures

Wednesday, June 24th, 2009

Jon Stokes has an excellent description of the two contrasting philosophies of information management in his comparison of the Palm Pre and the iPhone.

He names the two approaches “structure-and-browse” and “collect-and-query”. I feel like I’ve been groping for these terse descriptions for years!

OpenStreetMap constructs maps from GPS tracks!

Wednesday, February 21st, 2007

Sources and uses of digital information are in-scope for this blog, and a great example just showed up in my RSS reader today.

OpenStreetMap is a wiki-like project to build a world map using contributed GPS tracks [OpenGeoData pointed me there]. Their map of Baghdad is here.

This project is truly a product of the early 21st century: it requires GPS satellites, cheap but accurate GPS receivers, the World Wide Web, inexpensive computers with fast color graphics, and so forth.

And like all modern geographic applications, it also exploits a special property of GPS’s information domain: everyone agrees on the meaning of geographical location; only dates and times have a similar level of standardization. In relational-database terminology, this means that any table with a date or location column has a meaningful join with any other.

This doesn’t work with most data. I’ve had driver’s licenses in four U.S. states, but you can’t aggregate my driving record from the state records because they all use different ID numbering schemes (nice for my privacy in this case).

Also noteworthy is the fact that GPS information can be used to put a time dimension into maps, since we can tell when the street is used as well as where it is. There are some very pretty examples at Cabspotting.

Information Patterns: series introduction

Friday, February 2nd, 2007

Every time a new data format spec hits my inbox, I get a little twinge of dread.

Such documents are often enormous. They’re written in standardese (often badly). They’re usually written by committees. They go through a maze of twisty little revisions, all different.

But worst of all, they often bury their novelty in a sea of details that resemble those in the last spec I reviewed.

I’d like to do for data formats and other information representations what the Gang of Four book does for programs: call out and label the patterns that come up over and over again so that I can classify details into bigger chunks for mental processing.

You can expect to see several different kinds of post in this series:

  • Case studies. I have to look at lots of actual data formats in order to discern the patterns!
  • Data format patterns. Most posts will be about patterns I find in data formats…
  • Information usage patterns. …but some posts will be about how information is generated, stored, and used.
  • Other. I’ll probably think of some other topics as well.

I expect to look at simple cases, such as comma-separated values, as well as fiendishly complex cases, such as PDF. Programming-language syntaxes are fair game; database index disk structures are right out. In between, I’ll draw the boundary as interest dictates.

This series will be open-ended as long as people keep inventing data formats faster than I can look at them.

Kent’s Data and Reality [book pointer]

Thursday, December 28th, 2006

Let me kick off this blog by pointing to William Kent’s classic book Data and Reality.

Lots of books will teach you how to process data with particular technologies, but Kent’s book goes deeper. He shows in chapter after chapter how database practice fails to match the way humans actually use information.

Data and Reality is almost thirty years old, but the issues haven’t really changed: if anything, they’re much more in our collective faces.

This book may be for you if:

  • you feel strongly that you and your Social Security number (U.S. tax identification) are not the same thing,
  • you wonder whether Mark Twain and Samuel Clemens were the same person for all purposes,
  • you don’t know what to put down for Homer’s year of birth in that author/title cataloguing app you downloaded,
  • you wonder about people who think that something doesn’t exist if it’s not in the expected database (or if it’s not on the Web).

Information philosophy

If these issues sound a lot like the first ten minutes of a college philosophy course, that’s intended. Philosophy is all about seeking answers to questions we don’t often pause to ask.

Pause.

Pause.

We run into questions like these all the time in building software, especially now that we’ve woven the Web and woven ourselves and our lives into it.

On Information in Rotation I’m going to call this category “Information Philosophy”; I think it will get woven in with the more orthodox techy blog-fodder as we go along.

Anyhow, I strongly recommend Data and Reality, which is available as print-on-demand or (inexpensive) eBook from the publisher at the link I gave above (as of 2006-12-28).

[The paperback has ISBN 9781585009701 and the eBook 9781420898880. Does this make them different books?]