Skip to Content

Urania Berlin, June 6-7, 2011


Andrzej Białecki

Apache Lucene PMC Member

Andrzej Bialecki, Apache Lucene PMC Member, also serves as the project lead for Nutch, and as committer in the Lucene-java, Nutch and Hadoop projects. He has broad expertise, across domains as diverse as information retrieval, systems architecture, embedded systems, networking and business process/e-commerce modeling. He's also the author of the popular Luke index inspection utility. Andrzej holds a master's degree in Electronics from Warsaw Technical University.


Shevek is an expert Java programmer, and one part of the creative team behind Karmasphere, a San Francisco based big data analytics company. He has worked on cutting edge research in compilers and language design, algorithmic optimization, systems and security. He received a Doctorate in Computing from the University of Bath, England. He also holds a Masters in Pure Mathematics and an epee.

Tim Anglade

Tim Anglade works for Cloudant: data wranglers, DB crafters & cloud provider extraordinaire. In his spare time, he organized the NOSQL Summer, filmed the NOSQL Tapes and talks databases at conferences & meetups around the world. He’s an interlocutor & skeptic for all things NOSQL and hates writing about himself in the third person.

Shay Banon

Founder of ElasticSearch, an Open Source, distributed, RESTful, Search Engine. Director of Technology at GigaSpaces Technologies, a leading provider of a new generation of application platforms for Java and .Net environments. Shay started working on mission critical real time C/C++ systems, later moving to Java (and never looked back). Within the Java world, Shay has worked on a propriety implementation of a distributed rule engine(RETE) server, your typical Java based web projects, and messaging based projects

Ryan Betts

Ryan Betts is a software engineer at VoltDB, Inc., where he has played a key role in the development of VoltDB’s core technologies including distributed transaction management, fault tolerance, cluster management and RESTful APIs. Previously, Ryan was a software engineer at IBM (via acquisition of DataPower) and Lucent Technologies (via acquisition of Ascend).

Nick Burch

ASF member

Nick is heavily involved in a number of Apache projects, such as Tika, POI and Chemistry. He works as a senior developer at Alfresco, a leading open source Enterprise Content Management vendor. He is a big fan of many aspects of agile and extreme development, and rapid web development frameworks such as Django. Nick is often to be found attending or organising BarCamps, Geek Nights, or other such fun events dedicated to sharing what's great and new!

Rod Cope

Rod Cope is the CTO and Founder of OpenLogic, a provider of Open Source support and governance solutions for the enterprise.  He has over 25 years of software development experience in a wide range of industries and technologies.

Prior to founding OpenLogic, Rod worked for General Electric, IBM, IBM Global Services, and Anthem before starting his own consulting company. As a consultant, he has architected solutions for Ericsson, Ford, Manugistics, Integral, Goodyear, and many other companies of all sizes.

He is currently writing a book on cloud computing with open source for Manning.

Sean Cribbs

Sean joined Basho Technologies in March 2010 after three years as a freelance web developer, in the course of which he worked on numerous startups and open-source projects, including the popular Radiant content management system which is built with Ruby on Rails. Prior to that, he studied Music Theory and Composition at the graduate level and directed a chorus after receiving bachelors degrees in Computer Science and Music from the University of Tulsa.

Jean-Daniel Cryans

Jean-Daniel is a Database Engineer at StumbleUpon. When he’s not developing HBase or supporting its usage inside the company, he’s helping others with the Hadoop stack. Jean-Daniel has been a commiter on the Apache HBase project since 2008.

Doug Cutting

Keynote Speaker

Doug is a founder of several Apache open source projects, including Lucene, Nutch, Hadoop and Avro. Doug currently works for Cloudera, and previously worked at Yahoo!, Apple, Excite and Xerox PARC. Doug holds a Bachelor’s degree from Stanford University and currently chairs the Board of the Apache Software Foundation.

Dale Harvey

Dale Harvey is an Engineer at Couchbase and a long time enthusiast of Erlang, Javascript and Open Source. Dale previously founded the online spreadsheet company Hypernumbers before moving to Couchbase to help build an awesome web application platform.

Devaraj Das

Senior Engineer at Yahoo! Inc., Sunyvale, California.

Devaraj Das is a senior engineer at Yahoo! Inc., Sunnyvale, California. He has extensive experience in the area of Apache Hadoop and related technologies. He is a Apache Hadoop committer and a member of the Apache Hadoop PMC. Devaraj has a Masters degree from Indian Institute of Science, Bangalore, India.

Ted Dunning

Keynote Speaker

Ted Dunning is the Chief Application Architect at MapR Technologies. Prior to MapR, he was Senior Technology Advisor at DeepDyve, chief architect behind the MusicMatch, (now Yahoo Music) management and recommendation system. Ted is actively contributing to several Apache open source projects, including Hadoop and Mahout.

Daniel Einspanjer

Metrics Engineering Manager

Daniel Einspanjer is a metrics engineering manager at Mozilla where he
oversees a global team of metrics engineers focused on analyzing data
and sharing insights about Firefox and other Mozilla products. Daniel
has been with the Mozilla project since March 2008 and joined as the
very first metrics engineer. He has been instrumental in designing and
building out Mozilla's large data systems. He also designed the open
source based ETL systems that collect and process large scale data from
Mozilla's global datacenters into a data warehouse and Hadoop clusters.

Eric Evans

Eric Evans is a Debian Developer and Apache Cassandra committer. To the disgust of many he once coined the term "NoSQL", but has since apologized and promises to do better. Eric resides in San Antonio Texas where he works on distributed systems for Rackspace.

Felix Geisendörfer

Felix Geisendörfer has been an early contributor to node.js, and has experienced it's strength and shortcomings first-hand while building When not event-looping, he loves unicycling and watching squirrels.

Christoph Goller

Christoph Goller has more than 10 years of experience in the search industry. He got a Ph.D in computer science from the Technical University of Munich where he worked in several research projects on artificial intelligence, machine learning and neural networks. Christoph started his career at Lernout & Hauspie. Since 2002 he is Director Research of Intrafind Software AG (, a German company specialized on full-text search and text mining based on Lucene and Solr. Christoph is Lucene committer since 2004.

Otis Gospodnetić

Otis Gospodnetić is a coauthor of Lucene in Action (1st and 2nd edition). He has been involved with Lucene since 2000 and Solr since 2006. He is also a member of Nutch, and Mahout development teams, as well as Lucene Project Management Committee. Otis is an Apache Software Foundation member and the founder of Sematext, a software development and consulting company focused on Search & Analytics using open-source technologies like Lucene, Solr, Nutch, Hadoop, HBase, Flume, and more.

Jonathan Gray

Jonathan Gray is a long-time Apache HBase committer and user of Apache Hadoop and other open source projects. As a software engineer and open source advocate at Facebook, Jonathan works across product teams on data infrastructure and the usage, contribution, and release of open source software.

Stefan Groschupf

CTO at datameer

Stefan Groschupf is the co-founder and CEO of Datameer and is well known for his engineering accomplishments in data management and large-scale distributed computing.

Before Datameer, Stefan was the co-founder and CEO of Scale Unlimited, a leading provider of educational and consulting services for Hadoop and related technologies with proven success in companies such as HP, Sun, Apple and Nokia. Earlier, Stefan was CEO of 101Tec, a supplier of Hadoop and Nutch-based search and text classification software to industry-leading companies such as DHL and EMI Music.


Have lunch together with other developers, speakers and users of projects presented at Berlin Buzzwords. Go out for dinner and drinks at Nollendorfplatz close to the venue. Exchange your experience with searching, storing and analysing data with others.

Jakob Homan

Engineer with the SNA team at LinkedIn working on Apache Hadoop, Project Voldemort and Kafka. Apache Hadoop Committer and PMC member. While at Yahoo!, member of five person team that added end-to-end security to Apache Hadoop.

Nuno Job

Nuno Job works for MarkLogic, a Silicon Valley startup where he helps shape the future of database systems.

Before MarkLogic Nuno worked in New York at IBM T.J. Watson Research Center in the DB2 pureXML team. He also worked in IBM Toronto where he had a broader role in the DB2 organization.

Database geek. Evangelist. Consultant. Lecturer. Nuno loves open-source technology – most of all being in the edge, and looking for innovative ways to solve problems.

Random facts: Likes photography, dinosaurs, and funny hats. And people that put commas before and.

Thomas Koch

Maintainer of Debian Hadoop packages

After having studied musical education and physics Thomas Koch didn't become a teacher but worked in different software companies for the last 5 years. His last project was a media monitoring service for a swiss startup.

Hannes Kruppa

At Nokia Berlin, Hannes is running a development team which contributes to Nokia's Maps Search Engine. The team is particularly focused on machine learning and data-driven methods and has close ties to Nokia Research. In a previous life Hannes completed a PhD on visual/image search topics, and parts of his work can still be found in Intel's OpenCV software library, along with numerous publications. More:

Sylvain Lebresne

Software Engineer for Datastax

Sylvain is an Apache Cassandra Committer and Software Engineer for Datastax where he works on the core Apache Cassandra database. Previously, he worked at Yakaz, a real-time classified ads web service, where he developed a storage infrastructure built on Cassandra. He has a Ph.D in Computer Science from the University of Paris 7.

Tim Lossen

Tim Lossen works as Ruby backend developer at wooga. He lives
in Berlin, Germany with his girlfriend and two small daughters.

Tim is in love with technology and has been described as a "user
group junkie" -- he can often be found at meetings of the local
Ruby, Python, Javascript or Lisp user groups. In his remaining
spare time he likes to hack on secret hardware projects down in
the basement.

Chris Male

Chris Male is a 25yr old New Zealander, working as a software developer for JTeam in New Zealand. He has been using Solr and Lucene for a number of years in many different projects. He has been heavily involved with the development of Lucene's spatial search support and is one of the authors of the popular Hunspell analysis framework for Lucene.

Mathias Meyer

Developer Advocate with Basho Technologies

Mathias Meyer is a Developer Advocate with Basho Technologies. He has worked in cloud infrastructure and automation, with distributed databases and systems, experiencing the joy of both success and failure with all of them. His interest in learning more in all these fields is unhealthy and has turned him to writing the NoSQL Handbook.

Mark Miller

Mark Miller is a Lucene/Solr committer, PMC member, and Apache Software Foundation member who has been dabbling with Lucene since 2006. He currently works for LucidImagination as an engineer/manager solving all kinds of fun programming problems.

Julien Nioche

Julien is the founder of DigitalPebble Ltd, a consultancy based in Bristol, UK. He specialises in Information Retrieval, Text Analysis, Information Extraction, Opinion Mining and Text Classification with a strong expertise on open source solutions. Julien is a committer on Apache Nutch, Tika and Gora and contributes to several other open source projects.

Steven Noels

CEO of Outerthought

Steven Noels is co-founder and CEO of Outerthought, makers of the open source Daisy CMS, the Kauri webapp framework and Lily, the first NoSQL-based content repository. Previously deeply infected by the angle brackets bug, he ventured from ancient SGML rites to hipsters' XML and then internet applications and onwards. Outerthought is a Belgian software technology provider on the crossroads of open source, REST, NoSQL and Java-based internet development.

Stanislaw Osinski

"Stanislaw Osinski is the co-founder at Carrot Search and committer to a number of open source projects, including: Carrot2, Solr Contrib, High Performance Primitive Collections for Java, JUnitBenchmarks. He received his MSc degrees from Poznan University of Technology, Poland and The University of Sheffield, United Kingdom. His research interests include document clustering, information retrieval, text mining, user interface design and usability."

Uwe Schindler

Uwe is committer and PMC member of Apache Lucene and Solr. His main focus is on development of Lucene Java. He implemented fast numerical search and is maintaining the new attribute-based text analysis API. He studied Physics at the University of Erlangen-Nuremberg and works as managing director for SD DataSolutions GmbH in Bremen, Germany, a company that provides consulting and support for Apache Lucene and Solr.

Frank Scholten

I am a Java developer at JTeam with 4 years of experience and a Mahout user/contributor. I have a Msc. in Computer Science from University of Twente, with a Software Engineering major. I have worked on e-commerce sites, web-based administrative systems and systems integration projects, mostly working with Spring, JPA/Hibernate and Wicket. Currently I am researching Mahout clustering in general and job configuration and composition in particular, see MAHOUT-612

Mathias Stearn

Software engineer at 10gen

Mathias Stearn is a Software Engineer for 10gen, where he works on the core MongoDB server and maintains the C language driver. Previously, he worked at FactSet where he used MongoDB in a log analysis application. He has a degree in Computer Science from the University of Maryland.

Daniel Trümper

Daniel is a Software developer currently working at Retresco GmbH where he is creating scalable applications for the web. Currently he is working on Spyder, a scalable spider using Python and ZeroMQ. Daniel holds a Diploma in Economics and Computer Science and has been working on distributed systems and NoSQL databases using Java and Python for several years.

Joseph Turian

Joseph Turian, Ph.D., heads MetaOptimize LLC, which consults
on data monetization, including predictive analytics, business
intelligence, data mining, NLP, ML. He also run the MetaOptimize Q&A
site (, where Machine Learning and Natural
Language Processing experts share their knowledge.

Joseph Turian specializes in data problems that are large scale, and
require well-engineered solutions. He has almost a decade of experience
in ML+NLP, and almost two decades of experience in coding.

Joseph Turian holds a Ph.D.

Siem Vaessen

Siem Vaessen is managing partner at Zimmerman & Zimmerman. He is an expert in traditional and new media, both on the level of concept, design and implementation. He started his career at the Dutch press agency ABC Press in 1993 where he was responsible for the hardcopy international photo archive which consisted of the Magnum Photography collection anb the Corbis collection. In 1997 he was hired to manage the Dutch division of UPI and was co-responsible for the continious development of the digital photo archive through which UPI started delivering news imagery via the Internet.

Friso van Vollenhoven

Friso is Xebia's principal in the Netherlands on all things NoSQL, focussing on Hadoop and HBase for handling of substantial amounts of data. Friso has a history of dealing with architecture to achieve sufficiently scalable, performant and above all working software. He has more than ten years behind the keyboard to tell and educate about.

Recent buzz: Hadoop, HBase, nodeJS, HTML5.

Alvaro Videla

Alvaro works for Liip AG in Switzerland as Software Developer. Before moving to Europe he used to work in Shanghai were he helped building one of Germany biggest dating websites. He's co-authoring the book "RabbitMQ in Action" for Manning Publishing. Some of his open source projects can be found here: Apart from code related stuff he likes traveling with his wife, listening/playing music and reading books.

Lukáš Vlček

Lukáš Vlček started his career in 2000 as a graphic designer and web developer but was eventually grabbed by Java and server side coding. He spent many years developing applications for world wide organisations, but after all, his growing passion about Java open source brought him to JBoss in 2009 and his second passion for full text search led him to joining JBoss community team in June 2010. His responsibility (calling would be a better word) is to improve full text search for the web site and make it really rock!

Kai Voigt

Kai studied computer science in Kiel, his diploma thesis covered a HTTP session framework for web based applications.

For over 5 year, he worked for MySQL as an instructor and consultant, also covering MySQL Cluster, a distributed high available database.

Currently, he's an instructor at Cloudera, the major software and service supporting Hadoop and related projects.

Michael Wallace

Michael is a research assistant at Bristol University with a background in telecoms engineering and seven years of experience developing software for a range of applications including transport, medical physics and geographical sciences. Primarily using Python and Java, his past projects include a prototype in-vehicle information display and data capture system, and a web-based application for calculating and storing diode measurements for radiotherapy treatment QA.

Jim Webber

Chief Scientist with Neo Technology

Dr. Jim Webber is Chief Scientist with Neo Technology the company behind the popular open source graph database Neo4j, where he works on graph database server technology and writes open source software. Jim is interested in using big graphs like the Web for building distributed systems, which led him to being a co-author on the book REST in Practice, having previously written Developing Enterprise Web Services - An Architect's Guide. Jim is an active speaker, presenting regularly around the world. His blog is located at and he tweets often @jimwebber.

Dawid Weiss

Dawid Weiss shares academic and industrial background: he is an associate professor at the Institute of Computing Science of Poznan University of Technology in Poland (PhD in Information Retrieval) and co-owns Carrot Search, a company that provides commercial services revolving around text processing, text mining and text clustering. In his spare time Dawid contributes to several open source projects, including, reads books and passionately plays basketball with a bunch of his old friends. He lives in Poznan, Poland with his wife and two children.

Chris Wensel

Chris K Wensel is the founder of Concurrent, Inc., and the author of the Cascading data processing open-source project, an alternative API to MapReduce for Apache Hadoop.

He also co-founded Scale Unlimited, the first Hadoop and "Big Data" related professional services and training company, where he mentored and trained companies like Sun Microsystems, Apple, and numerous startups in the Bay Area.

Chris bootstrapped his first Internet startup in the early 90's, creating an early Web server-side scripting language used in the real estate and insurance verticals.

Matthias Wessendorf

Matthias is a senior software developer at Kaazing, where he is focusing on HTML5, WebSocket and related technologies to enable the "next generation web".

Simon Willnauer

Committer Lucene Core/ PMC member

Simon is a Lucene core committer and PMC member. During the last couple of years he worked on design and implementation of scalable software systems and search infrastructure. He studied Computer Science at the University of Applied Sciene Berlin. Currently, he work as a consultant for Apache Solr, Lucene Java and Hadoop and is a co-organizer of the "BerlinBuzzwords" conference on Scalability June 2010 / 2011 in Berlin.