girlincomputerscience.blogspot.com
Girl in the World of Computer Science: February 2015
http://girlincomputerscience.blogspot.com/2015_02_01_archive.html
Girl in the World of Computer Science. Wednesday, February 11, 2015. Many Machine Learning articles and papers describe the wonders of the Support Vector Machine. SVM) algorithm. Nevertheless, when using it on real data trying to obtain a high accuracy classification, I stumbled upon several issues. I will try to describe the steps I took to make the algorithm work in practice. This model was implemented using R. And the library "e1071". To install and use it type:. Few algorithms are better than SVM.
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: Dealing with NP-Hard Problems: An Introduction to Approximation Algorithms
http://girlincomputerscience.blogspot.com/2014/01/dealing-with-np-hard-problems.html
Girl in the World of Computer Science. Friday, January 31, 2014. Dealing with NP-Hard Problems: An Introduction to Approximation Algorithms. This is just a quick overview on approximation algorithms. It is a broad topic to discuss. For more info rmation. The famous NP-Complete class is known for its possible intractability. NP means non deterministic polynomial. And for a problem to be NP-Complete it has to be. NP (verified in polynomial time) and. NP-Hard (as hard as any other problem in the NP class).
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: June 2013
http://girlincomputerscience.blogspot.com/2013_06_01_archive.html
Girl in the World of Computer Science. Tuesday, June 18, 2013. Apache Hive .orig test file and "# # A masked pattern was here # # ". Just a quick information about something in Hive. If you ever typed:. Ant clean package test. To run Apache Hive. Unit tests, you may have seen that Hive sometimes creates two output files. If you run for example:. Ant test -Dtestcase=TestCliDriver -Dqfile=alter5.q. Hive sometimes generates a alter5.q.out and a alter5.q.out.orig :. A masked pattern was here # #. Apache Hive...
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: Overview of Digital Cloning
http://girlincomputerscience.blogspot.com/2013/12/overview-of-digital-cloning.html
Girl in the World of Computer Science. Friday, December 20, 2013. Overview of Digital Cloning. The growth of the image processing and editing software availability has made it easy to manipulate digital images. With the amount of digital content being generated nowadays, developing techniques to verify the authenticity and integrity of digital content might be essential to provide truthful evidences in a forensics case. 1] divide an image in blocks of pixels and compare them to find a forgery. Once block...
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: August 2014
http://girlincomputerscience.blogspot.com/2014_08_01_archive.html
Girl in the World of Computer Science. Tuesday, August 5, 2014. Lecture on Recommender Systems. Great lecture on Recommender Systems by Xavier Amatriain, Researcher on Netflix. Https:/ www.youtube.com/watch? Https:/ www.youtube.com/watch? Posted by Renata Ghisloti Duarte de Souza Granha. Subscribe to: Posts (Atom). Renata Ghisloti Duarte de Souza Granha. View my complete profile. Lecture on Recommender Systems. There was an error in this gadget. Data Science with R. Mining of Massive Datasets.
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: March 2013
http://girlincomputerscience.blogspot.com/2013_03_01_archive.html
Girl in the World of Computer Science. Thursday, March 28, 2013. IBM BigData approach: BigInsights. Hadoop and BigData have been two tremendous hot topic lately. Although many people want to dig into Hadoop and enjoy the benefits of Big Data, most of them don't know exactly how to do it or where to start it. This is where BigInsights. BigInsights is the Apache Hadoop. Related software from IBM. And its many built-in features and capabilities leverage your start point. You can find it here:. Posted by Ren...
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: Removing Outliers to Plot Data
http://girlincomputerscience.blogspot.com/2015/07/removing-outliers-to-plot-data.html
Girl in the World of Computer Science. Friday, July 31, 2015. Removing Outliers to Plot Data. I am currently working a lot with R. One simple thing that helps me to better visualize data is to plot it excluding outliers. To do so, first read the data. Data = read.table(“myfile.txt”). Then, you can check how data is distributed. Quantile(data, c(.02, .05, .10, .50, .90, .95, .98). An example output would be. 2% 5% 10% 50% 90% 95% 98%. 189 190 190 194 241 275 316. X - quantile(data, c(.01, .99).
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: Understanding Apache Hive
http://girlincomputerscience.blogspot.com/2013/08/understanding-apache-hive.html
Girl in the World of Computer Science. Tuesday, August 20, 2013. Apache Hive is a software application created to facilitate data analyses on Apache Hadoop. It is a Java framework that helps extracting knowledge from data placed on a HDFS cluster by providing a SQL-like interface to it. The Apache Hadoop platform is a major project on distributed computing and it is commonly assumed to be the best approach when dealing with BigData challenges. Hadoop facilitates large data processing, but it still requir...
girlincomputerscience.blogspot.com
Girl in the World of Computer Science: May 2013
http://girlincomputerscience.blogspot.com/2013_05_01_archive.html
Girl in the World of Computer Science. Tuesday, May 21, 2013. BigData Free Course Online. Offers several great online courses from the best universities around the world. The courses involve video lectures being released weekly, work assignments for the student, and reading material indications. I had enrolled on this course about BigData. A couple of months ago, and I confess I didn't have time to start doing it since last week. Once I started the course I was pleased with the content presented.