<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Alec's thoughts</title>
	<atom:link href="http://www.flett.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.flett.org</link>
	<description></description>
	<lastBuildDate>Thu, 15 Oct 2009 17:08:40 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>I want to know what I should know about GMO</title>
		<link>http://www.flett.org/2009/10/15/i-want-to-know-what-i-should-know-about-gmo/</link>
		<comments>http://www.flett.org/2009/10/15/i-want-to-know-what-i-should-know-about-gmo/#comments</comments>
		<pubDate>Thu, 15 Oct 2009 17:05:24 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=66</guid>
		<description><![CDATA[I don&#8217;t know what to think at this point. I&#8217;m obviously a big organic-eating back slapping liberal like the rest of my cronies in fair Berkeley. My wife and I grow vegetables in our back yard and do our best to eat food that was grown and processed in natural, sustainable ways. We compost, we [...]]]></description>
			<content:encoded><![CDATA[<p>I don&#8217;t know what to think at this point. I&#8217;m obviously a big organic-eating back slapping liberal like the rest of my cronies in fair Berkeley. My wife and I grow vegetables in our back yard and do our best to eat food that was grown and processed in natural, sustainable ways. We compost, we recycle, and we teach our kids the importance and impact of all of these things. I watched <a href="http://www.freshthemovie.com/">Fresh</a> and <a href="http://www.foodincmovie.com/">Food, Inc.</a> this year and read <a href="http://www.amazon.com/gp/product/0060938455?ie=UTF8&amp;tag=alecflettsweb-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0060938455">Fast Food Nation</a> years ago, nodding vigorously through all of them.</p>
<p>I&#8217;m also a big tech geek and do my best to keep up, and maybe even develop, things that help our society make the next great leap forward. I helped develop the original Netscape browser which helped the internet explode and ripped control of computer networks from Microsoft, handing it to the masses. I&#8217;m working on doing the same with open, public data right now at <a href="http://www.freebase.com/">Freebase</a>. I do believe that science and technology are bettering our society as a whole and that the risks and drawbacks far outweigh the rewards. I think they are making the world a more equitable place and giving more choices to more individuals than ever before, and I think this is a good thing.</p>
<p>So when it comes to GMO food, I&#8217;m a little confused. On the one hand, the notion of actually modifying the genetics of an organization at a cellular level seems like some kind of creepy science. On the other hand, this is just science improving the quality of life, driving down the costs of basic human sustenance. It&#8217;s just a logical extension of breeding crops for various traits, right? Some years ago I read one or two random articles (I think one was in Harpers, can&#8217;t remember what else I read) that had me thinking that on the whole, GMO food is bad. The science behind it can&#8217;t begin to address the massive complexity of our ecosystem. Further, the politics and policy behind patents on organisms, the limits that Big Agra puts on farmers for seed retention, and the notion of GMO as a way to reduce genetic diversity are really bad.</p>
<p>But with all the hubub recently about GMO + Organic and the Obama administration&#8217;s interest in the food system has given me a chance to at least try to reevaluate my position. The problem comes when I watch video&#8217;s like this Bill Nye video (In three parts: <a href="http://www.youtube.com/watch?v=Y4Cn9KqeZlw">one</a>, <a href="http://www.youtube.com/watch?v=zCNzLoUOy5g&amp;feature=PlayList&amp;p=082292F7B8A62D8F&amp;playnext=1&amp;playnext_from=PL&amp;index=28">two</a>, <a href="http://www.youtube.com/watch?v=tfutpBMUQ_8&amp;feature=PlayList&amp;p=082292F7B8A62D8F&amp;playnext=1&amp;playnext_from=PL&amp;index=29">three</a>) that I found via this <a href="http://civileats.com/2009/10/14/kitchen-table-talks-what-you-need-to-know-about-genetically-engineered-food/">Civil Eats Article on GMO food</a>. I love Bill Nye. I think he makes science really cool and fascinating and I can&#8217;t wait until my kids are old enough to watch him. But this video is incredibly biased against GMO while trying to appear like he&#8217;s showing both sides. The worst part is that most of the anti-GMO bits are either morally heavy, substance free (&#8221;But isn&#8217;t genetic modification just creepy? Should we really be messing with organisms like this?&#8221;) or just fear mongering (dramatic enactments of monster food killing people, theoretical implications that haven&#8217;t actually happened, etc)</p>
<p>One argument I&#8217;ve heard (that got Agriculture Secretary Tom Vilsack booed) is that GMOs can be used for good &#8211; to feed the world! The counter argument I&#8217;ve heard to that is that basically we have enough food, that it&#8217;s really a distribution problem &#8211; that the GMO-to-feed-the-world is a lot of bunk. What I wonder though, is if it&#8217;s really a &#8220;distribution problem&#8221; why can&#8217;t we find ways to grow food near the people that need it? The bay area has lots of self-proclaimed locavores who aspire to eat food grown within 50-150 miles from them, but why then do we need to ship food from one side of africa to the other? What if one solution to that is GMO crops that crow in climates that currently don&#8217;t support human-food agriculture? What if it would take 500 years to breed the equivalent crop?</p>
<p>So I don&#8217;t know. I think next up I&#8217;m going to watch a bunch of Long Now talks: <a href="http://www.longnow.org/seminars/02009/jul/28/organically-grown-genetically-engineered-food-future/">Organically Grown and Genetically Engineered</a>, <a href="http://www.longnow.org/seminars/02009/oct/09/rethinking-green/">Rethinking Green</a>, and Michael Pollan&#8217;s <a href="http://www.longnow.org/seminars/02009/may/05/deep-agriculture/">Deep Agriculture</a> to see if I can gain any more insight.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2009/10/15/i-want-to-know-what-i-should-know-about-gmo/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SF Food Carts, twitter, and street food</title>
		<link>http://www.flett.org/2009/08/17/sf-food-carts-twitter-and-street-food/</link>
		<comments>http://www.flett.org/2009/08/17/sf-food-carts-twitter-and-street-food/#comments</comments>
		<pubDate>Mon, 17 Aug 2009 16:25:45 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=61</guid>
		<description><![CDATA[Last week I was sitting in the public space atrium at Mission &#038; 2nd st in San Francisco huddled over my laptop trying to get some work done, and who should roll in but Carte415. Thanks to some in-the-know co-workers and twitter, I&#8217;ve been following this summer&#8217;s explosion of foodie-friendly food carts rolling around San [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I was sitting in the public space atrium at Mission &#038; 2nd st in San Francisco huddled over my laptop trying to get some work done, and who should roll in but <a href="http://carte415.com/">Carte415</a>. Thanks to some in-the-know co-workers and twitter, I&#8217;ve been following this summer&#8217;s explosion of foodie-friendly food carts rolling around San Francisco but up until that moment, I hadn&#8217;t actually seen one.</p>
<p>When I first started hearing about these carts, it was really all about word-of-twitter for finding these folks &#8211; there were few online resources and you had to just know. Personally that kind of thing drives me nuts &#8211; mostly because I wasn&#8217;t in-the-know. While there&#8217;s something exciting about knowing these little out-of-the-way places, it feels a little like a high school clique. On top of that, I have a friend who has been wanting to do what I consider a fairly original independent food thing for a while &#8211; he started an LLC, rented some kitchen space, but didn&#8217;t really get beyond the stages of trying recipes. He lost steam because he couldn&#8217;t figure out a good market for his food. </p>
<p>..and he wasn&#8217;t even aware of the whole food cart scene! So in the interest of promoting openness and transparency in the foodie scene, I present my list of favorite food carts, most of which I haven&#8217;t visited because I don&#8217;t live in the mission.</p>
<ul>
<li><a href="http://twitter.com/cremebruleecart">cremebruleecart</a> &#8211; one of the originals, he gets the top of my list because Heather and I tried and failed to find him once on a friday night (our timing was off) and Heather&#8217;s favorite dessert is Creme Brulee. Mostly around Dolores Park, but goes to lots of special events</li>
<li><a href="http://twitter.com/CARTE415">carte415</a> &#8211; Looks like fancy organic sandwiches and salads. My cheapo go-to for sandwiches is the Toaster Oven &#8211; their sandwiches are actually pretty good but I feel a little guilty going there because it&#8217;s a chain &#8211; so any chance of getting reasonably cheap organic sandwiches has got me excited. Going there today. </li>
<li><a href="http://twitter.com/chowdermobile">chowdermobile</a> &#8211;  Seems like this guy has been trying to get into SF forever. Not sure what the holdup is but I really, really want some good clam chowder&#8230; he seems to go up and down the penninsula.</li>
<li><a href="http://twitter.com/littleskillet">littleskillett</a> &#8211; ok, this isn&#8217;t a cart, and is only this low on the list because I <i>have</i> been there, a few times even. They make some pretty amazing fried chicken but the best thing I had was this crazy pile of pulled-pork and other fixins on top of grits. Plus, they have Blue Bottle coffee to prevent post-chicken food coma. The reason they fit into this list is because they serve out of a counter in an alley and you eat on the loading dock. So it&#8217;s still street food because you&#8217;re sitting on the street, quite literally.
</li>
</ul>
<p>That&#8217;s all I have for now, there are a few more I&#8217;m curious about like <a href="http://twitter.com/kitchenettesf">kitchenettesf</a>, <a href="http://twitter.com/chezspencergo">chezspencergo</a>, and <a href="http://twitter.com/SexySoupCart">SexySoupCart</a>, but they&#8217;ll have to wait until I&#8217;ve exhausted the above list. </p>
<p>(If you&#8217;re looking for more, you can look at the folks <a href="http://twitter.com/alecf">I&#8217;m</a> <a href="http://twitter.com/alecf/following">following on twitter</a>.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2009/08/17/sf-food-carts-twitter-and-street-food/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2700 San Pablo filing for bankruptcy?</title>
		<link>http://www.flett.org/2008/12/11/2700-san-pablo-filing-for-bankruptcy/</link>
		<comments>http://www.flett.org/2008/12/11/2700-san-pablo-filing-for-bankruptcy/#comments</comments>
		<pubDate>Thu, 11 Dec 2008 17:41:31 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[berkeley develpment]]></category>
		<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/2008/12/11/2700-san-pablo-filing-for-bankruptcy/</guid>
		<description><![CDATA[Check out this article in the Berkeley Daily Planet: San Pablo Condo Project Defaults, Forced Sale Scheduled. I live very close to this building, and the building is, sort of, a big improvement from the abandoned gas station that was on this lot previously. But ever since it&#8217;s been finished, all of the storefronts and [...]]]></description>
			<content:encoded><![CDATA[<p>Check out this article in the Berkeley Daily Planet: <a href="http://www.berkeleydailyplanet.com/issue/2008-12-04/article/31720?headline=San-Pablo-Condo-Project-Defaults-Forced-Sale-Scheduled">San Pablo Condo Project Defaults, Forced Sale Scheduled</a>. I live very close to this building, and the building is, sort of, a big improvement from the abandoned gas station that was on this lot previously. But ever since it&#8217;s been finished, all of the storefronts and street-level live/work spaces have been unoccupied.<br />
About a year ago this building was being finished, and 3 more projects were in the works: <a href="http://www.ci.berkeley.ca.us/contentdisplay.aspx?id=780">2747 San Pablo</a> (a huge 40+ unit building where there is already a business), <a href="http://www.ci.berkeley.ca.us/ContentDisplay.aspx?id=22310">2748 San Pablo</a> (a 20-23 unit building where Clay of the Land used to be) and another one adjacent to the one at 2700 San Pablo (I can&#8217;t find the link now, where there used to be a car dealership)</p>
<p>Now all these projects seem to be on hold. New businesses have sprung up in the latter lots using the existing buildings. I must admit I&#8217;m pretty disappointed with how the whole thing turned out.</p>
<p>I am actually a bit in favor of some actual development along this corridor because the vacant lots and failing businesses were of questionable value to the neighborhood. (The used care dealership was just full of broken down cars, I always wondered who actually went there to buy a car, because the cars never seemed to change) But with this housing downturn we may be heading back where it was before&#8230;</p>
<p>And now I just found this: <a href="http://www.berkeleydailyplanet.com/issue/2008-12-11/article/31798?headline=San-Pablo-Condos-Top-ZAB-s-Agenda">San Pablo Condos Top ZAB’s Agenda</a> &#8211; looks like some nearby projects are still going forward. Huh.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2008/12/11/2700-san-pablo-filing-for-bankruptcy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tramp mode rocks</title>
		<link>http://www.flett.org/2008/12/11/tramp-mode-rocks/</link>
		<comments>http://www.flett.org/2008/12/11/tramp-mode-rocks/#comments</comments>
		<pubDate>Thu, 11 Dec 2008 17:27:06 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[geeking-out]]></category>
		<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/2008/12/11/tramp-mode-rocks/</guid>
		<description><![CDATA[ok, it&#8217;s been a while since my last post, but I&#8217;m going to try starting up again.
I just want to rave about &#8220;tramp mode&#8221; in emacs. If you haven&#8217;t yet seen this, it allows you to load up files from a machine that you have ssh access to. Accessing it is super-easy. Rather than C-x [...]]]></description>
			<content:encoded><![CDATA[<p>ok, it&#8217;s been a while since my last post, but I&#8217;m going to try starting up again.</p>
<p>I just want to rave about &#8220;tramp mode&#8221; in emacs. If you haven&#8217;t yet seen this, it allows you to load up files from a machine that you have ssh access to. Accessing it is super-easy. Rather than C-x C-f to load a local file path, just enter the file path as ssh://userid@host:/path/</p>
<p>After that everything you save will be saved over ssh/scp. Brilliant.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2008/12/11/tramp-mode-rocks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Welcome back..</title>
		<link>http://www.flett.org/2007/03/20/welcome-back/</link>
		<comments>http://www.flett.org/2007/03/20/welcome-back/#comments</comments>
		<pubDate>Tue, 20 Mar 2007 20:21:15 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/2007/03/20/welcome-back/</guid>
		<description><![CDATA[Ok, so it&#8217;s been well over a year since I last updated this blog. I&#8217;ve had numerous things to say, but the ideas always come to me on the bus, or in the shower, or somewhere else where I don&#8217;t have access to a keyboard. I&#8217;m going to once again try to revitalize this blog [...]]]></description>
			<content:encoded><![CDATA[<p>Ok, so it&#8217;s been well over a year since I last updated this blog. I&#8217;ve had numerous things to say, but the ideas always come to me on the bus, or in the shower, or somewhere else where I don&#8217;t have access to a keyboard. I&#8217;m going to once again try to revitalize this blog with some actual comments and insights. First up, I&#8217;ve got an entry about development in Berkeley.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2007/03/20/welcome-back/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Why NOT to eat organic</title>
		<link>http://www.flett.org/2005/10/05/why-not-to-eat-organic/</link>
		<comments>http://www.flett.org/2005/10/05/why-not-to-eat-organic/#comments</comments>
		<pubDate>Wed, 05 Oct 2005 23:04:37 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[advocacy]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=51</guid>
		<description><![CDATA[A while back I was exercising my writing, trying to find an voice for this blog, and wrote Why to shop organic. A friend of mine recently gave me a hard time about it and through a funny confluence of events, I found two reasons not to eat organic.
Reason number one: Probably the reason Heather [...]]]></description>
			<content:encoded><![CDATA[<p>A while back I was exercising my writing, trying to find an voice for this blog, and wrote <a href="http://www.flett.org/2003/05/18/why-to-shop-organic/">Why to shop organic</a>. A friend of mine recently gave me a hard time about it and through a funny confluence of events, I found two reasons <em>not</em> to eat organic.</p>
<p>Reason number one: Probably the reason Heather used to call organic strawberries &#8220;armpit fruit&#8221;:<br />
<a href="javascript:void(window.open('/wp-content/worm.jpg','width=640,height=480'))"><br />
<img src='/wp-content/thumb-worm.jpg' alt='Artichoke' /></a><br />
Yes, that is a dead worm in my artichoke. Yes, I had to eat this far to discover it. <img src='http://www.flett.org/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Reason number two: Sure they don&#8217;t use pesticides, but I don&#8217;t want babies working the fields any more than I want 12 year olds making my shirts.</p>
<p><a href="javascript:void(window.open('/wp-content/earthgrains.jpg','width=800,height=600'))"><br />
<img src='/wp-content/thumb-earthgrains.jpg' alt='Organic Rice Cereal Box' /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2005/10/05/why-not-to-eat-organic/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Andy Rooney on Iraq</title>
		<link>http://www.flett.org/2005/10/03/andy-rooney-on-iraq/</link>
		<comments>http://www.flett.org/2005/10/03/andy-rooney-on-iraq/#comments</comments>
		<pubDate>Mon, 03 Oct 2005 16:31:54 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[advocacy]]></category>

		<guid isPermaLink="false">http://www.flett.org/2005/10/03/andy-rooney-on-iraq/</guid>
		<description><![CDATA[I never thought I&#8217;d be sending around something that took Andy Rooney seriously, but this morning I ran into a post on BoingBoing that blew me away. Last night Andy Rooney&#8217;s segment on 60 minutes (BitTorrent link) blasted the Iraq effort in a way that I think much of Middle America can understand: basic facts. [...]]]></description>
			<content:encoded><![CDATA[<p>I never thought I&#8217;d be sending around something that took Andy Rooney seriously, but this morning I ran into <a href="http://www.boingboing.net/2005/10/03/andy_rooney_has_a_po.html">a post on BoingBoing</a> that blew me away. Last night <a href="http://www.wakahiru-me.com/media/vid/cbs/cbs_60min_andy_rooney_iraq_war_051002a.mov">Andy Rooney&#8217;s segment</a> on 60 minutes (<a href="http://torrent.crooksandliars.com/60%20Minutes-AR-cost-of-the-war-10-2.mov.torrent">BitTorrent link</a>) blasted the Iraq effort in a way that I think much of Middle America can understand: basic facts. (Also see the <a href="http://www.cbsnews.com/stories/2005/09/30/60minutes/main892398.shtml">transcript</a>. </p>
<p>I have a theory that many more people would be against the Iraq war and more critical of the Whitehouse administration if they simply understood the implications for this country. For example, I wonder how many people know that our budget this year for defense is $336 billion, yet our educational budget is $61 billion? I wonder how many people would support the simplest proposal of say, cutting $30 billion from the defense budget in order to increase the education budget by a whopping 50%?</p>
<p>And so I can&#8217;t begin to express how pleased I am that someone like Andy Rooney, who is typically viewed as fairly harmless, suddenly has become so vocally critical of the war. I think the mainstream media finally got some backbone with their outrage over the handling of Katrina, but I&#8217;m going to predict that Andy Rooney&#8217;s segment yesterday is a turning point for public criticism of the war and this administration. I think this changes the face of opposition. I think for many people it all sounds like the just the rantings of some that crazy mom Cindy Sheehan, or some crazy Californians who are too disconnected from the real world to have a legitimate voice, or some vocal celebrities jumping on the bandwagon of rebelliousness.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2005/10/03/andy-rooney-on-iraq/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
<enclosure url="http://www.wakahiru-me.com/media/vid/cbs/cbs_60min_andy_rooney_iraq_war_051002a.mov" length="4784619" type="video/quicktime" />
		</item>
		<item>
		<title>Building a graph-based model of metadata</title>
		<link>http://www.flett.org/2005/08/03/building-a-graph-based-model-of-metadata/</link>
		<comments>http://www.flett.org/2005/08/03/building-a-graph-based-model-of-metadata/#comments</comments>
		<pubDate>Wed, 03 Aug 2005 15:47:46 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=50</guid>
		<description><![CDATA[I have had some success building an in-memory graph of  my iTunes database, in Python. I discovered some rather interesting things about my collection in the process and I&#8217;ve started thinking about a way to use this information to cleanly chunk the data.
In my graph, nodes are represented by Python tuples that refer to [...]]]></description>
			<content:encoded><![CDATA[<p>I have had some success building an in-memory graph of  my iTunes database, in Python. I discovered some rather interesting things about my collection in the process and I&#8217;ve started thinking about a way to use this information to cleanly chunk the data.</p>
<p>In my graph, nodes are represented by Python tuples that refer to the metadata culled from the song list. For example, there is a node for (&#8217;Artist&#8217;, &#8216;U2&#8242;) and another for (&#8217;Genre&#8217;, &#8216;Rock&#8217;). I keep track of the relationship between these nodes with a weight that comes from the number of songs that have both of these pieces of metadata.</p>
<p>So for example there is a line between (&#8217;Artist&#8217;, &#8216;U2&#8242;) and (&#8217;Genre&#8217;, &#8216;Rock&#8217;) which has a weight of 15, because their new album is categorized as &#8216;Rock&#8217; &#8211; though songs from the album October are categorized as &#8216;Rock/Pop&#8217;</p>
<p>When I combine all the different pieces of metadata in my collection I get a whopping 1589 different facets, represented by nodes in my graph. But whats more interesting is that about 1500 of these nodes are connected, and the other 90 or so are divided into about 30 different individual chunks of 3-4 facets each. I tried to visualize this with <a href="http://www.graphviz.org/">GraphViz</a> but the data was just too big.</p>
<p>But this got me thinking more about how to chunk the graph. It was really surprising that so many of the nodes were connected, but really what matters to me is knowing which nodes are the <em>most</em> connected. This means that I could start dropping lines (connections) between nodes where the weight is just 1&#8230; or 2, or whatever number yields an appropriately chunked graph. Hopefully that will break up the large cluster of facets into smaller, more usable clusters.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2005/08/03/building-a-graph-based-model-of-metadata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A graph based model for chunking</title>
		<link>http://www.flett.org/2005/08/01/a-graph-based-model-for-chunking/</link>
		<comments>http://www.flett.org/2005/08/01/a-graph-based-model-for-chunking/#comments</comments>
		<pubDate>Mon, 01 Aug 2005 16:31:48 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=49</guid>
		<description><![CDATA[Factor Analysis seems very promising, but I was thinking a lot about a presentation given by Mimi Yin at OSAF. In particular the Venn diagrams which showed items as existing in a number of collections based on the attributes of the item. These collections may or may not really exist in real life, but their [...]]]></description>
			<content:encoded><![CDATA[<p>Factor Analysis seems very promising, but I was thinking a lot about a <a href="http://wiki.osafoundation.org/bin/view/Journal/VirtualityPresentationImages">presentation </a>given by Mimi Yin at OSAF. In particular the Venn diagrams which showed items as existing in a number of collections based on the attributes of the item. These collections may or may not really exist in real life, but their virtual existence is important.<br />
<span id="more-49"></span><br />
For example, in my iTunes collection I might have a bunch of music by U2. While the songs (or mp3 files) themselves may be distributed anywhere in my music collection, they belong to a virtual collection of songs that all have the &#8220;Artist&#8221; attribute equal to &#8220;U2&#8243;. Some of these songs may exist in a virtual collection where &#8220;Album&#8221; is &#8220;Unforgettable Fire&#8221; and some of them may be in the &#8220;Genre&#8221; &#8220;Rock&#8221;</p>
<p>In the presentation, these virtual collections were presented with  colored regions, and the items themselves were little dots that exist in multiple regions. I think what could be potentially interesting is the way that these regions overlap because of the songs that link them.</p>
<p>So I am developing a graph based model, a very simple one really, where each vertex is a particular value, or facet, such as &#8220;Artist=U2&#8243; and each line between two verticies represents the songs that exist in both collections. So along each line is a set of actual songs, and the verticies themselves exist only in the virtual sense. The lines can be assigned a particular weight based on the songs that it represents. A simple weight would simply be the number of songs  on the line itself.</p>
<p>By connecting all this information I believe what we&#8217;ll come up with is a fairly well connected graph, but with a great varition in line weights. This variation isn&#8217;t random and the patterns that develop will correlate with clusters in the graph.</p>
<p>It may be obvious by now that this is really just a graph-based representation of the correlation matrix. I&#8217;m making a wild-ass assumption that Factor Analysis doesn&#8217;t deal well with large numbers of factors, but perhaps some graph-walking algorithms can at least reduce the graph cluster at a time? Time to dust off my old Algorithms text book from college&#8230; <img src='http://www.flett.org/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2005/08/01/a-graph-based-model-for-chunking/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>An exploration: Chunking using Factor Analysis</title>
		<link>http://www.flett.org/2005/07/22/an-exploration-chunking-using-factor-analysis/</link>
		<comments>http://www.flett.org/2005/07/22/an-exploration-chunking-using-factor-analysis/#comments</comments>
		<pubDate>Fri, 22 Jul 2005 18:30:50 +0000</pubDate>
		<dc:creator>alecf</dc:creator>
				<category><![CDATA[projects]]></category>

		<guid isPermaLink="false">http://www.flett.org/?p=48</guid>
		<description><![CDATA[I&#8217;ve been developing my ideas about chunking as I&#8217;ve been writing. My faith that there is structure expressed by facets keeps me believing that there is a way to extract this structure.
Last year I read (most of) The Mismeasure of Man by Stephen J Gould. Aside from being a fantastic book, its last chapter on [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been developing my ideas about chunking as I&#8217;ve been writing. My faith that there is structure expressed by facets keeps me believing that there is a way to extract this structure.</p>
<p>Last year I read (most of) <a href="http://www.amazon.com/exec/obidos/ASIN/0393314251/alecflettsweb-20?dev-t=D1II307N4QU78O%26camp=2025%26link_code=xm2">The Mismeasure of Man</a> by Stephen J Gould. Aside from being a fantastic book, its last chapter on Factor Analysis has been floating around in my head for quite some time. I think this could be one way to extract the kind of chunks I am looking for.<br />
<span id="more-48"></span></p>
<h3>Factor Analysis, as I understand it</h3>
<p>Briefly, Factor Analysis is a way of taking long lists of data, usually multiple datapoints of multiple items, and trying to figure out how many factors are really at play. As Gould says, &#8220;factor analysis simplifies large sets of data by reducing dimensionality and trading some loss of information for the recognition of ordered structure in fewer dimensions.&#8221;</p>
<p>Here&#8217;s an example of how this might work in a biological study. Measurements of 10 different bones in 50 different members of a species are taken. You generate a correleation matrix of each of the 10 measurements, so you have a 10&#215;10 matrix. Each measurement is perfectly correlated with itself of course so the diagonal is 1.0. Then you can actually try to factor the matrix. What this does is reduce the number of dimensions (factors) that can predict all of the other measurements with a reasonable degree of accuracy.</p>
<p>In a simple case, you might find that all bones are consistently about the same proportional length in each creature. The femur is consistenly about 10% longer than the tibia, so the tibia could be used as a ruler for all other bones. Thus, there might be only one factor, growth, which is determining the size of all 10 bones. That factor, which is a number, can be used to predict the length of all 10 bones with some high degree of accuracy.</p>
<p>In a more complex case, you might find that there are really 2 factors at play, and that the measurement of those 10 bones is a function of those two factors. The femur length might be 1.1 x tibia length times 1.03 x fibula. This means there are independent factors contributing to the length of the tibia and the fibula, but given these two measures, you can predict other bone lengths.</p>
<p>In both of these cases, if you don&#8217;t care about absolute accuracy, you no longer have to keep the 10 measurements of all the bones when describing a creature. You could just refer to its size as it relates to the tibia, or the tibia and the fibula.</p>
<h3>So how does this apply in the world of metadata?</h3>
<p>If you imagine that instead of 50 creatures, we have 10,000 songs. Each of those songs has some amount of metadata, or properites, associated with it. </p>
<p>Now most of this metadata is not numeric, so its hard to compare the value of one Artist (&#8221;U2&#8243;) to another (&#8221;Suzanne Vega&#8221;). Whats more important is to determine a value that can be used for correlation between any two bits of metadata. For instance, if U2 and Suzanne Vega appear on an album together, then they are pretty closely correlated. If U2 and Coldplay are in the same genre, they may also be closely correlated. There are lots of possibilities &#8211; if two albums came out in the same year, if two artists both covered the same song, if two genres have songs by the same artist, and so forth. </p>
<p>So really what you end up with is a correlation matrix between all combinations of metadata. i.e. &#8220;Album: Unforgettable Fire&#8221; and &#8220;Genre: Hip Hop&#8221; are just two &#8220;values&#8221; or columns in the correlation matrix.</p>
<p>Looking at my iTunes collection, I see that I have 77 Genres, 623 artists, and 743 albums. All told that&#8217;s a correlation matrix 1443&#215;1443. Wow, that&#8217;s a big matrix. Lets hope Factor Analysis can be used on such huge datasets!</p>
<p>So what does it mean to factor such a matrix? If you imagine that your data is not evenly distributed within each of the metadata categories (i.e. you might have more u2 than anyone) then what you have to imagine is that each of these clusters have a few primary themes running through them. As I understand Factor Analysis, what we should end up with is the sort of &#8216;hubs&#8217; within clusters. </p>
<p>Factor Analysis is typically used to find a &#8220;principle component&#8221; &#8211; a primary dimension that can often determine much of the rest of the dataset. This primary component can be measured by checking how many of the vectors in the matrix project well onto this primary component. So for many biological systems, you might find that the principle component describes some large portion of the information recorded, and thus its not necessary to find other components.</p>
<p>In the case of information stored in iTunes, I&#8217;m guessing that the principle component will only weakly describe a set of the data. Instead of describing some 90%, or even 50% of the correlations in the database, I&#8217;ll bet the &#8220;principle component&#8221; is describes less than 10% of correlations well. So if your principle component doesn&#8217;t describe much, you want a secondary component. In factor analysis, all components are perpendicular to each other. What I&#8217;m hoping in the case of iTunes is that this means that if my principle component is say &#8220;Artist: U2&#8243;, then my secondary component might be something totally unrelated like &#8220;Genre: Hip Hop&#8221; (And part of me secretly wonders if all the components are going to boil down to Genres, which might be sad)</p>
<p>So I think I have the tools to generate a correlation matrix, but the question is whether I have the tools to turn that matrix into a set of useful factors?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.flett.org/2005/07/22/an-exploration-chunking-using-factor-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
