<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Information in Rotation &#187; Areas of application</title>
	<atom:link href="http://appliedrotation.com/Techblog/?cat=3&#038;feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://appliedrotation.com/Techblog</link>
	<description>Dan Rabin writes on metadata, data, the information they represent and how.</description>
	<lastBuildDate>Sun, 01 Nov 2015 20:21:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
		<item>
		<title>Personal Geotagging: Data wrangling</title>
		<link>http://appliedrotation.com/Techblog/?p=153</link>
		<comments>http://appliedrotation.com/Techblog/?p=153#comments</comments>
		<pubDate>Sun, 01 Nov 2015 20:21:20 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Geotagging]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=153</guid>
		<description><![CDATA[I recently learned that the process of cleaning datasets so that they can really be used is called &#8220;data wrangling&#8221;. At first, I thought that the main data wrangling task in personal geotagging was going to be cleaning the GPS &#8230; <a href="http://appliedrotation.com/Techblog/?p=153">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I recently learned that the process of cleaning datasets so that they can really be used is called &#8220;data wrangling&#8221;.  </p>
<p>At first, I thought that the main data wrangling task in personal geotagging was going to be cleaning the GPS data.  Sometimes the GPS is too coarse, sometimes it&#8217;s off because not enough satellites were in view, sometimes the GPS was just not turned on when I took the photo.  </p>
<p>But, in the course of resetting my cameras&#8217; clocks to standard time today, I realized that I&#8217;m going to have to wrangle the timestamps on the photos as well.  I had thought that one-hour increments due to daylight savings and time-zone travel would be the main problem, but now I see that the minutes matter, too.  Camera clocks don&#8217;t sync with the network or with GPS (at least not on my relatively ancient cameras), and I can move a significant distance in the three or seven minutes by which the clock has drifted away from GPS time.  </p>
<p>I&#8217;m going to have to be able to apply short time offsets to the timestamps of all photos taken by a given camera on a given day.  Setting the correction is going to require precisely recognizing at least one key location in each batch of photos in order to compute the offset.  </p>
<p>That&#8217;s for past photos.  For the future, I can get in the habit of either correcting the clock frequently or of snapping a reference shot with my cellphone along with each set of camera shots.</p>
<p>Space-time is tricky.</p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=153</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal geotagging: note 1</title>
		<link>http://appliedrotation.com/Techblog/?p=123</link>
		<comments>http://appliedrotation.com/Techblog/?p=123#comments</comments>
		<pubDate>Thu, 24 Sep 2015 05:06:34 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Geographic]]></category>
		<category><![CDATA[Geotagging]]></category>
		<category><![CDATA[Personal information]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=123</guid>
		<description><![CDATA[The common item that lets you find a photo&#8217;s location in a GPS track is the time it was taken. In database terminology, time is the join column. Since I&#8217;m starting with photos that are already timestamped, I want to &#8230; <a href="http://appliedrotation.com/Techblog/?p=123">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The common item that lets you find a photo&#8217;s location in a GPS track is the time it was taken.  In database terminology, time is the join column.</p>
<p>Since I&#8217;m starting with photos that are already timestamped, I want to be able to look up their timestamps in some sort of GPS database to see whether there&#8217;s a recorded time/location event near that time.  Unfortunately, my initial perusal of a couple of geo-extensions to well known databases (SpatialLIte, PostGIS), show a generous helping of geometry-oriented query capabilities but no obvious time-query capabilities.  </p>
<p>It could be that time-indexing is so simple compared with spatial indexing that what I&#8217;m looking for is just swamped in the documentation, or it could be that I&#8217;ll have to dig some more.  </p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=123</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal data: geotagging photos</title>
		<link>http://appliedrotation.com/Techblog/?p=120</link>
		<comments>http://appliedrotation.com/Techblog/?p=120#comments</comments>
		<pubDate>Wed, 02 Sep 2015 06:54:20 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Geographic]]></category>
		<category><![CDATA[Geotagging]]></category>
		<category><![CDATA[Personal information]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=120</guid>
		<description><![CDATA[I have over 50,000 photos I&#8217;d like to tag with the locations they were taken. In a perfect world, this would be a relatively simple matter of looking up each photo&#8217;s creation date and time, finding that date and time &#8230; <a href="http://appliedrotation.com/Techblog/?p=120">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I have over 50,000 photos I&#8217;d like to tag with the locations they were taken.  </p>
<p>In a perfect world, this would be a relatively simple matter of looking up each photo&#8217;s creation date and time, finding that date and time in my years of GPS logs, and associating my GPS location for that date and time with the photo.</p>
<p>The world isn&#8217;t perfect.  I took a lot of photos before I got a GPS device.  Sometimes the device&#8217;s batteries give out or I forget to turn it on.  There may be time zone and daylight-savings issues that make nonsense of the date-and-time link between the photo and the location, and I need to find and fix them.</p>
<p>Even if I manage to bring perfection to my existing data sets, I had better design a smooth-enough workflow for my future photos that I&#8217;ll actually keep up, and of course I&#8217;ll need to correct future mistakes and omissions.</p>
<p>Further, tagging the photos really becomes useful if I put them on maps, so I need a tool to make located map icons that point to the photos.  I already tag my photos with broad categories, and I&#8217;d like to be able to vary the icon according to the category, or to sort the categories into separate map layers.</p>
<p>These are the basic goals.  I&#8217;ll be blogging more as I design and build this thing.</p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=120</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Genome data at NCBI</title>
		<link>http://appliedrotation.com/Techblog/?p=71</link>
		<comments>http://appliedrotation.com/Techblog/?p=71#comments</comments>
		<pubDate>Sun, 13 Sep 2009 23:27:47 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Areas of application]]></category>
		<category><![CDATA[Biology]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=71</guid>
		<description><![CDATA[The National Center for Biotechnology Information (U.S.) has a nice online viewer for the genomes of many organisms, including Homo sapiens. The human genome has just over 3 billion base pairs in about 25 thousand genes. This is a large &#8230; <a href="http://appliedrotation.com/Techblog/?p=71">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.ncbi.nlm.nih.gov/">National Center for Biotechnology Information</a> (U.S.) has a nice <a href="http://www.ncbi.nlm.nih.gov/Genomes/">online viewer</a> for the genomes of many organisms, including <it>Homo sapiens</it>.</p>
<p>The human genome has just over 3 billion base pairs in about 25 thousand genes.  This is a large enough data set that it gets algorithms of its own.</p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=71</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Chris Anderson: One size metadata doesn&#8217;t fit all</title>
		<link>http://appliedrotation.com/Techblog/?p=30</link>
		<comments>http://appliedrotation.com/Techblog/?p=30#comments</comments>
		<pubDate>Wed, 21 Mar 2007 17:04:23 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Areas of application]]></category>
		<category><![CDATA[Information Philosophy]]></category>
		<category><![CDATA[Information usage patterns]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Musical]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=30</guid>
		<description><![CDATA[Misfits of Metadata Chris Anderson of The Long Tail has an important post about how the metadata used in some music-listening applications doesn&#8217;t satisfy the listeners needs: [...] classical is a genre that the one-size-fits-all music aggregators such as iTunes &#8230; <a href="http://appliedrotation.com/Techblog/?p=30">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<h3>Misfits of Metadata</h3>
<p>Chris Anderson of <a href="http://www.longtail.com/the_long_tail/" title="The Long Tail">The Long Tail</a> has an<a href="http://www.longtail.com/the_long_tail/2007/03/one_size_aggreg.html" title="One size metadata doesn't fit all"> important post</a> about how the metadata used in some music-listening applications doesn&#8217;t satisfy the listeners needs:</p>
<blockquote><p>[...] classical is a genre that the one-size-fits-all music aggregators such as iTunes don&#8217;t handle particularly well. They&#8217;re oriented around pop music, with its artist, album, track data format. Meanwhile classical music organizes around composer, conductor, performer, soloist</p></blockquote>
<p>He also voices my exact peeve about how jazz is treated:</p>
<blockquote><p>However, neither of them does a very good job with Jazz, where the individual musicians are often more meaningful than the band.</p></blockquote>
<p>Yup.  No reasonable cataloguer of jazz recordings separates &#8220;Thelonious Monk Trio&#8221; from &#8220;Thelonious Monk Quartet&#8221; from &#8220;Thelonious Monk&#8221;.  At the same time, it&#8217;s important to be able to locate all appearances of Thelonious Monk, regardless of whether he was the leader of the session (note that &#8220;leader&#8221; and &#8220;session&#8221; are appropriate terms in jazz discography, but not for pop or classical).
</p>
<h3>When your only tool is a hammer&#8230;</h3>
<p>I can&#8217;t help but wonder if the problems Chris calls out in iTunes come from the poor selection of data tools in most applications programmers&#8217; toolkits.  Relational databases, the current orthodox storage technique, favor using one or more tables, each consisting of records having the same selection of attributes.  There are hacks you can use to simulate having, say, jazz tracks and pop tracks in the same Tracks relation, but hacks and simulations tend to twist one&#8217;s code, so most programmers resist going there.
</p>
<h3>An XML database in every toolbox!</h3>
<p>
We don&#8217;t really have to live this way anymore.  With the popularity of <a href="http://www.w3.org/XML/">XML</a> for data interchange, the tools ecology has given us a <a href="http://www.rpbourret.com/xml/XMLDatabaseProds.htm">variety</a> of <a href="http://www.rpbourret.com/index.htm">XML database systems</a>.  The XML data model has the flexibility to represent varying record structures: in fact, it has much more flexibility than we need for the purpose!</p>
<p>Heretical as it may seem to put the cart of an interchange format before the horse of data abstraction, the XML situation is very useful in practice, at least for databases of moderate size.  The <a href="http://www.w3.org/%20">W3C</a> has come up with the <a href="http://www.w3.org/Style/XSL/">XPath</a> and <a href="http://www.w3.org/XML/Query/">XML Query</a> specifications that provide excellent query mechanisms for data represented in the XML model.  XML Query in particular is designed to look somewhat familiar to the hardened <a href="http://www.jcc.com/sql.htm">SQL</a> user.  There&#8217;s data typing taken from the <a href="http://www.w3.org/XML/Schema">XML Schema</a> <a href="http://www.w3.org/TR/xmlschema-2/">datatype recommendataion</a> as well.</p>
<h3>Better nails</h3>
<p>Anyhow, let&#8217;s learn to design with a more flexible hammer, and maybe we&#8217;ll be able to hit a wider class of nails, rather than our users&#8217; thumbs!</p>
<p><em>March is International Runaway Metaphor Month.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=30</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenStreetMap constructs maps from GPS tracks!</title>
		<link>http://appliedrotation.com/Techblog/?p=29</link>
		<comments>http://appliedrotation.com/Techblog/?p=29#comments</comments>
		<pubDate>Wed, 21 Feb 2007 17:06:04 +0000</pubDate>
		<dc:creator>Dan Rabin</dc:creator>
				<category><![CDATA[Areas of application]]></category>
		<category><![CDATA[Geographic]]></category>
		<category><![CDATA[Information Philosophy]]></category>
		<category><![CDATA[Information usage patterns]]></category>

		<guid isPermaLink="false">http://appliedrotation.com/Techblog/?p=29</guid>
		<description><![CDATA[Sources and uses of digital information are in-scope for this blog, and a great example just showed up in my RSS reader today. OpenStreetMap is a wiki-like project to build a world map using contributed GPS tracks [OpenGeoData pointed me &#8230; <a href="http://appliedrotation.com/Techblog/?p=29">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Sources and uses of digital information are in-scope for this blog, and a great example just showed up in my RSS reader today.</p>
<p><a href="http://wiki.openstreetmap.org/index.php/Main_Page" title="OpenStreetMap home page">OpenStreetMap</a> is a wiki-like project to build a world map using contributed GPS tracks [<a href="http://www.opengeodata.org/?p=167">OpenGeoData</a> pointed me there].  Their map of Baghdad is <a href="http://wiki.openstreetmap.org/index.php/Image:Baghdad.png">here</a>.  </p>
<p>This project is truly a product of the early 21st century: it requires GPS satellites, cheap but accurate GPS receivers, the World Wide Web, inexpensive computers with fast color graphics, and so forth.</p>
<p>And like all modern geographic applications, it also exploits a special property of GPS&#8217;s information domain: everyone agrees on the meaning of geographical location; only dates and times have a similar level of standardization.  In relational-database terminology, this means that any table with a date or location column has a meaningful join with any other.  </p>
<p>This doesn&#8217;t work with most data.  I&#8217;ve had driver&#8217;s licenses in four U.S. states, but you can&#8217;t aggregate my driving record from the state records because they all use different ID numbering schemes (nice for my privacy in this case).</p>
<p>Also noteworthy is the fact that GPS information can be used to put a time dimension into maps, since we can tell <em>when</em> the street is used as well as <em>where</em> it is.  There are some very pretty examples at <a href="http://cabspotting.org/timelapse.html">Cabspotting</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://appliedrotation.com/Techblog/?feed=rss2&#038;p=29</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
