stuartsierra.com
A Million Little Files – Digital Digressions by Stuart Sierra
https://stuartsierra.com/2008/04/24/a-million-little-files
Digital Digressions by Stuart Sierra. From programming to everything else. A Million Little Files. Here’s some code (Apache license), including all the Apache jars needed to make it work:. Unpack it and run:. Java -jar tar-to-seq.jar. The output sequence file is BLOCK-compressed, about 1.4 times the size of a bzip2-compressed tar file. Each key is the name of a file (a Hadoop “Text”), the value is the binary contents of the file (a BytesWritable). April 24, 2008. April 24, 2008. Do hadoop offer some?
shmsoft.blogspot.com
SHMsoft blog: October 2014
http://shmsoft.blogspot.com/2014_10_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Monday, October 27, 2014. Big Data Cartoon: Big Data needs big muscle. Inspired possibly by this cartoon in New Yorker. Our illustrator has set out to tell us that being in Big Data, you travel a lot, and of course avail yourself of the exercise facilities found in each and every hotel. My latest was a packed gym in downtown San Francisco. Lately, I've been noticing that trainers at Elephant Scale have been gaining muscle weight. Links to this post.
ivanlife.wordpress.com
¡Atento! | Ivan's blog
https://ivanlife.wordpress.com/¡atento
Erasmus, Inglaterra, Hertfordshire, mis aficiones…. Te gustaría enterarte de las últimas actualizaciones del blog? Pues échale vistazo a este post. Que escribí con ese mismo propósito. Me encantan las fotos, en cada una de ellas expresa, lo qu en determinados momentos uno siente, y con una imagen de naturaleza sobran las palabras, muestras en todo momento lo que sientes…. muy buenas fotos. Muchas gracias por tus palabras Danna! Deja una respuesta Cancelar respuesta. Introduce aquí tu comentario.
shmsoft.blogspot.com
SHMsoft blog: Big Data Cartoons - Summer of Big Data
http://shmsoft.blogspot.com/2015/07/big-data-cartoons-summer-of-big-data.html
Hadoop, Big Data, Spark - and some eDiscovery. Friday, July 3, 2015. Big Data Cartoons - Summer of Big Data. Since nothing much happens in Big Data in the summer (JK:), our artist took to making pictures of the breakfasts that an artist needs. Here are some examples. Once this page is visited by more than a million people, it itself will qualify for a "Big Data" page. Subscribe to: Post Comments (Atom). FreeEed - Open source eDiscovery. Review of “Monitoring Hadoop” by Gurmukh Singh.
shmsoft.blogspot.com
SHMsoft blog: September 2014
http://shmsoft.blogspot.com/2014_09_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Tuesday, September 30, 2014. Got an Ubuntu laptop! Quite powerful and good-looking, from System76. (It is the one in the middle). Now I have a chance to be productive while traveling or working in friends' place. I am planning to add Windows in a VM, stay tuned. Links to this post. Sunday, September 7, 2014. Big Data Cartoon: NY is new Silicon Valley. Silicon Valley, pay attention! Links to this post. Subscribe to: Posts (Atom). Got an Ubuntu laptop!
shmsoft.blogspot.com
SHMsoft blog: July 2014
http://shmsoft.blogspot.com/2014_07_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Thursday, July 24, 2014. FreeEed does Concordance (R). The latest release of FreeEed (V4.4) allows import into Concordance (R) eDiscovery management software. Here. It also contains a number of fixes. You can use FreeEed in so many ways:. Start a FreeEed server on Amazon, no hardware needed;. Download a virtual machine to your workstations;. Install in Windows, Linux, or Mac. And all of the popcorn advantages. Links to this post. Wednesday, July 2, 2014.
shmsoft.blogspot.com
SHMsoft blog: February 2015
http://shmsoft.blogspot.com/2015_02_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Tuesday, February 24, 2015. Big Data Cartoon - What's with Pivotal? Last week, Pivotal joined its forces with its former rival, Hortonworks. Announcing that they will form a join Hadoop Core platform. In my understanding, Pivotal is giving up its own distribution of Hadoop in favor of Hortonworks Data Platform. However, Yevgeniy Sverglik on DataCentralKnowledge. Links to this post. Monday, February 23, 2015. FreeEed technologies led to DARPA project. But the...
shmsoft.blogspot.com
SHMsoft blog: Joe Witt of Onyara presented Apache NiFi
http://shmsoft.blogspot.com/2015/06/joe-witt-of-onyara-presented-apache-nifi.html
Hadoop, Big Data, Spark - and some eDiscovery. Wednesday, June 10, 2015. Joe Witt of Onyara presented Apache NiFi. Joe Witt and the team of Onyara came to present Apache Nifi at Houston Hadoop Meetup. The NiFi project is the result of eight years of development at NSA, which has been open sourced in November of 2014. The project is for automating enterprise dataflows, and its salient use cases are. Data Processing (enrichment, filtering, sanitization). For the rest, in the words of Shakespeare.
shmsoft.blogspot.com
SHMsoft blog: July 2015
http://shmsoft.blogspot.com/2015_07_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Wednesday, July 22, 2015. Review of “Monitoring Hadoop” by Gurmukh Singh. Is recently published, April 2015, and it covers Nagios, Ganglia, Hadoop monitoring and monitoring best practices. The same goes for Ganglia: it is covered in sufficient detail for one to be able to install and run, with enough attention to Hadoop specifics. What I did not find in the book, and what could be useful. to read further. Links to this post. Links to this post. By itself, GA...
shmsoft.blogspot.com
SHMsoft blog: November 2014
http://shmsoft.blogspot.com/2014_11_01_archive.html
Hadoop, Big Data, Spark - and some eDiscovery. Wednesday, November 26, 2014. Big Data Cartoon - What is text analytics? Analytics may be the next big thing in Big Data, but it is very hard to define what it really is. Firstly, this word shows as misspelled in the browser and in Word or OpenOffice. Secondly, it's too vague and nebulous. As always, when in doubt, we turn to our illustrator, and our RK can illuminate us with a simple to understand cartoon that even data scientists can get. Links to this post.