<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Die Welt ist gar nicht so. &#187; wikimediacommons</title>
	<atom:link href="http://blog.dieweltistgarnichtso.net/tag/wikimediacommons/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.dieweltistgarnichtso.net</link>
	<description>Sie ist ganz anders.</description>
	<lastBuildDate>Mon, 23 Sep 2013 15:41:20 +0000</lastBuildDate>
	<language>de-DE</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.0.35</generator>
	<item>
		<title>WissensWert-Projekt: Open-Access-Importer für Wikimedia Commons</title>
		<link>http://blog.dieweltistgarnichtso.net/wissenswert-projekt-open-access-importer-fur-wikimedia-commons</link>
		<comments>http://blog.dieweltistgarnichtso.net/wissenswert-projekt-open-access-importer-fur-wikimedia-commons#comments</comments>
		<pubDate>Wed, 18 Jan 2012 18:07:32 +0000</pubDate>
		<dc:creator><![CDATA[erlehmann]]></dc:creator>
				<category><![CDATA[Freie Lizenzen]]></category>
		<category><![CDATA[In eigener Sache]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[oaimporter]]></category>
		<category><![CDATA[openaccess]]></category>
		<category><![CDATA[wikimediacommons]]></category>
		<category><![CDATA[wissenswert2011]]></category>

		<guid isPermaLink="false">http://blog.dieweltistgarnichtso.net/?p=4458</guid>
		<description><![CDATA[Zusammen mit Daniel Mietchen und Raphael Wimmer werde ich in den kommenden Monaten Software entwickeln, um wissenschaftliche Inhalte automatisiert in Wikimedia Commons zu übertragen. Das Projekt wird im Rahmen des WissensWert-Wettbewerbs finanziell unterstützt durch Wikimedia Deutschland. Details gibt es im &#8230; <a href="http://blog.dieweltistgarnichtso.net/wissenswert-projekt-open-access-importer-fur-wikimedia-commons">Weiterlesen <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>
Zusammen mit <a href="http://en.wikiversity.org/wiki/User:Mietchen">Daniel Mietchen</a> und <a href="http://www.uni-regensburg.de/sprache-literatur-kultur/medieninformatik/sekretariat-team/raphael-wimmer/index.html">Raphael Wimmer</a> werde ich in den kommenden Monaten Software entwickeln, um <a href="http://de.wikiversity.org/wiki/Benutzer:OpenScientist/Offenes_Antragschreiben/Wissenswert_2011">wissenschaftliche Inhalte automatisiert in <i>Wikimedia Commons</i> zu übertragen</a>. Das Projekt wird <a href="http://blog.wikimedia.de/2011/12/15/wissenswert-2011-wir-gratulieren-den-fuenf-gewinnern/">im Rahmen des <i>WissensWert</i>-Wettbewerbs finanziell unterstützt durch <i>Wikimedia Deutschland</i></a>.
</p>
<p>
<a href="http://wir.okfn.org/2012/01/18/project-introduction-open-access-media-importer-for-wikimedia-commons/">Details gibt es im Blog der <i>Open Knowledge Foundation</i></a>, in dem ich wöchentlich über den Fortgang des Projektes bloggen werde.
</p>

<span id="more-4458"></span>

<blockquote cite="http://wir.okfn.org/2012/01/18/project-introduction-open-access-media-importer-for-wikimedia-commons/">
<p>
<a href="http://en.wikipedia.org/wiki/Open_access"><em>Open Access</em></a> scientific literature contains, almost by definition, content suitable – both in substance and licensing – for <a href="http://en.wikipedia.org/wiki/Wikimedia_Commons"><em>Wikimedia Commons</em></a>. However, currently, there seems to be no automated, easy way to identify such files, convert them into <a href="http://commons.wikimedia.org/wiki/Commons:Project_scope/Allowable_file_types">appropriate formats</a> and import them into <em>Commons</em>.
</p>

<p>
In November 2011, <a href="http://en.wikiversity.org/wiki/User:Mietchen">Daniel Mietchen</a> submitted <a href="http://en.wikiversity.org/wiki/User:OpenScientist/Open_grant_writing/Wissenswert_2011">a proposal</a> tackling the issue to the <a href="http://wikimedia.de/wiki/WissensWert"><em>WissensWert</em></a> funding scheme run by <a href="http://meta.wikimedia.org/w/index.php?title=Wikimedia_Deutschland/en&amp;uselang=en">the German chapter of <em>Wikimedia</em></a>. Among other projects, <a href="http://blog.wikimedia.de/2011/12/15/wissenswert-2011-wir-gratulieren-den-fuenf-gewinnern/">it was chosen to receive funding</a> (see <a href="http://wir.okfn.org/2011/12/15/supplementary-materials-to-wikimedia-commons-see-you-soon/">Daniel&#8217;s post</a>). As part of the team implementing the software envisioned, I will blog here about once a week until project conclusion.
</p>

<p>
Initially, the project will be focused on audio and video content available in <a href="http://en.wikipedia.org/wiki/PubMed_Central"><em>PubMed Central</em></a>&#8216;s <a href="http://www.ncbi.nlm.nih.gov/pmc/tools/openftlist/"><em>Open Access Subset</em></a> – however, <a href="http://en.wikiversity.org/wiki/User:OpenScientist/Open_grant_writing/Wissenswert_2011/Documentation">the toolchain is intended to be modular</a>, so other sources can be added as development continues.
</p>

<p>
The only component currently existing is a proof-of-concept <a href="https://github.com/erlehmann/open-access-media-importer/blob/master/crawler/crawler.py">crawler / downloader</a>: It downloads archives containing <abbr title="Extensible Markup Language">XML</abbr> files – each about a GiB in size – from <em>PubMed Central</em>, identifies articles referring to supplementary materials (attachments) and displays <abbr title="Uniform Resource Locator">URL</abbr>s to retrieve those.
</p>

<p>
Until next week, I intend to add metadata collection – minimally author, source and licensing terms – and downloading of supplementary materials. <a href="http://www.uni-regensburg.de/sprache-literatur-kultur/medieninformatik/sekretariat-team/raphael-wimmer/index.html">Raphael Wimmer</a> also proposed an option to only download new articles, which could reduce network load by several orders of magnitude compared to the currently existing naive implementation.
</p>

<p>
In line with the principles of <a href="https://en.wikipedia.org/wiki/Free_culture_movement"><em>free culture</em></a>, all tools will be released as <a href="http://www.gnu.org/philosophy/free-sw.html">Free Software</a>, licensed under the <a href="http://gnu.org/licenses/gpl-3.0-standalone.html"><abbr title="GNU is Not Unix">GNU</abbr> General Public License, version 3</a> (or any later version of the License published by the <a href="http://fsf.org/"><em>Free Software Foundation</em></a>).
</p>

<p>
<a href="https://github.com/erlehmann/open-access-media-importer">The source code is hosted on <em>GitHub</em>.</a>
</p>
</blockquote>]]></content:encoded>
			<wfw:commentRss>http://blog.dieweltistgarnichtso.net/wissenswert-projekt-open-access-importer-fur-wikimedia-commons/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
