wizard of oz

The Wizard of Oz, Smart City version!

Miguel Izquierdo Business, Data Science, Technology Leave a Comment

Once upon a time, a girl named Dorothy lived in the state of Kansas Oz, in a smart city named Emerald City. A huge tornado struck this city, but fortunately, Dorothy and all the other citizens had been evacuated way before the tornado reached them. In this city, people used to guide themselves through a yellow brick road, but sometime after, …

logo_codemotion

Codemotion 2017: Un derroche de talento y carisma

Miguel Izquierdo Big Data Architecture, Business, Technology 1 Comment

Este año hemos acudido por primera vez al Codemotion y además en calidad de invitados. Sabíamos que se trataba de un evento grande entre los desarrolladores, pero no fuimos muy conscientes de su magnitud hasta que presenciamos la ingente cantidad de participantes, las amplias salas repletas de gente y la locura de tener que ir con tiempo si querías encontrar …

Datatons at SCEWC17

Smart City Expo World Congress 2017 – A window to the cities of the future

Miguel Izquierdo Business Leave a Comment

Last week the Smart City Expo World Congress 2017 took place, and following the publication of our paper on Public Transport Optimization, Datatons was invited to attend the event as speakers. We were thrilled to hear the news and time flew between the notification and November 14th! So the date arrived, we packed our things, unfolded our best presentation cards …

Hierarchical Clustering of Twitter Followers

Manuel Lamelas Data Science, Technology 2 Comments

Hi again!!! In this new post we are going to explore an impressive way of clustering Twitter followers, using the Datatons account as an example. We will try to segment our followers in different groups and see what they have in common. For this we are going to use Hierarchical Clustering in Python. If you want to see the complete code …

eShow 2017: Big Data en la corte del E-Commerce

Miguel Izquierdo Business Leave a Comment

Hace unos días tuvo lugar el eShow en el IFEMA, ¡y no queríamos demorarnos más en publicar nuestro artículo sobre el evento! Lo primero de todo, para aquellos que no hayáis escuchado antes hablar de ello, el eShow es una feria sobre el E-Commerce, que tiene lugar en Barcelona y Madrid durante el segundo y cuarto trimestre, respectivamente. Cualquier evento …

Google Developer Days Europe: A pleasant surprise

Manuel Lamelas Business Leave a Comment

“Too much developer stuff!”, I remember telling Inés prior to the event. Not much of an excuse, but guess I wasn’t really trying to convince anyone and thankfully, my colleague not only didn’t share my grumpy mood, but her usual cheerful attitude easily helped me overcome my reluctancy. So this is how our trip to Krakow began, with little expectations …

Big Data es el New Black

Datatons Business Leave a Comment

Todo el mundo de habla de Big Data,  sobre si es una de las tecnologías mas disruptivas de los últimos años, del potencial que tiene, de los problemas de privacidad derivados de las nuevas capacidades para almacenar datos… Es cierto que poco a poco, aunque cada vez son más, las empresas se están lanzando a la aventura de los datos con …

Data Logistics with Apache Nifi

Manuel Lamelas Big Data Architecture, Technology Leave a Comment

As announced in a previous post we’re now going to introduce you to Apache Nifi, the latest trend in ingestion tools. A new project from the Apache Software Foundation that allows you to manage data flows with a cool graphical interface. If we didn’t catch your attention yet, wait until you hear this: NSA created it!!! Nifi – the UPS of data …

Kerberos & Hadoop: Securing Big Data (part I)

Celeste Duran Big Data Architecture, Technology 1 Comment

When I began to use Hadoop with Kerberos I felt as I was in the middle of the ocean. I found a lot of information about Kerberos technology but it was very difficult for me to find something about how to use it on Hadoop, why to use it and how to configure it for working with Hadoop. This trilogy of posts is going to …

Morphlines – Hadoop ETL by Cloudera

Manuel Lamelas Big Data Architecture Leave a Comment

Today we are going to talk about Morphlines,  an open source framework developed by Cloudera, that provides a new way to do ETL on Hadoop. What are these morphlines? Morphlines are simple configurations files that defines how to transform data on the fly. It consists on a file that describes the steps a data flow has to pass in order to …