infofarm.be
MLlib | InfoFarm
http://www.infofarm.be/tools/mllib
Skip to main content. MLlib is a machine learning library related to Spark. It can be used with Java, Scala or Python and it runs on existing Hadoop clusters and data. Spark excels at iterative computation, enabling MLlib to run fast: up to a 100 times faster than MapReduce. MLlib contains high-quality algorithms that leverage iteration, and can yield better results than the one-pass approximations sometimes used on MapReduce. Seminar Machine Learning in Practice available on Slideshare. Tools (Mahout, R,.
infofarm.be
R | InfoFarm
http://www.infofarm.be/tools/r
Skip to main content. R is developed as a statistical software, hence off course it is scoring very high on statistics. We are not pretending that all statistical formulas can be found in one of the many packages, but it will support almost every statistical analysis you can think of. Computer science and machine learning. R has many ways to support your communication. For example, documents can be written in R markdown. In order to make presentations you can use a wide variety: you can make beam...R als...
infofarm.be
Pig | InfoFarm
http://www.infofarm.be/tools/pig
Skip to main content. Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallellization, which in turns enables them to handle very large data sets. Extensibility: users can create their own functions to do special-purpose processing. Easy recommendations on Hadoop. People ofte...
infofarm.be
Spark | InfoFarm
http://www.infofarm.be/tools/spark
Skip to main content. Spark can run on Hadoop 2's YARN and can read any existing Hadoop data. It is developed to run programs faster by making more use of in-memory data processing. Spark developers claim that it runs 100 times faster than Hadoop MapReduce in memory or 10 times faster on disk. Spark has its own machine learning library, namely MLlib. It can be called upon using Java, Scala or Python. All 'LOF' for the team. But first things first! They have more and. Where are the students? 32 3 871 99 66.
infofarm.be
InfoFarm
http://www.infofarm.be/tools
Skip to main content. Hadoop and its filesystem HDFS is open-source software (part of Apache) for distributed processing of Big Data. R is the go-to tool of our Data Scientists. It's great for exploratory analysis and allows easy access to statistical, mathematical and machine learning functions. Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. 32 3 871 99 66.
infofarm.be
Hadoop | InfoFarm
http://www.infofarm.be/tools/hadoop
Skip to main content. Hadoop and its filesystem HDFS is open-source software (part of Apache) for distributed processing of Big Data. It allows you to set up a cluster of computers and reliably store your data. It has a master-slave structure, where the master is called the "namenode" and that knows where and how the data is stored in large blocks (64Mb or 128Mb). There is redundancy to prevent loss of data. So basicly it is a way to store and read your data. Easy recommendations on Hadoop. 32 3 871 99 66.
infofarm.be
InfoFarm
http://www.infofarm.be/solutions
Skip to main content. Want some of our knowledge on Big Data and Data Science solutions transferred to yourself and/or your colleagues? We have some courses and training materials at hand, and can teach you what we know best. New to Big Data? Want to learn about Data Science? Eager to learn what Hadoop is all about? InfoFarm can organise a small workshop at your company to tell you and your colleagues all about it! In half a day you learn about all the concepts of the domain of your choice. 32 3 871 99 66.
infofarm.be
Students left... quite an impression! | InfoFarm
http://www.infofarm.be/articles/students-left-quite-impression
Skip to main content. Students left. quite an impression! Giving points to all tasks, such a strict alignement, ." But when she said it, she admitted that it helped quite a lot! Even an administrator view was finished during the last sprint. At InfoFarm we did put this in the backlog, thinking they would never be able to finish it. But then they would learn how to ask to skip some work. Unfortunately. No luckily they did not have to learn this! And the application itself? 32 3 871 99 66.
infofarm.be
InfoFarm
http://www.infofarm.be//articles
Skip to main content. The alternating least squares algorithm in recommenderlab. Many sites nowadays use recommendation engines in order to offer additional value to the client and increase revenue. This technology is used for a wide variety of subjects, ranging from movies, music, books, news and restaurants to candidates on online dating sites. In this post we go into detail of a new alternating least squares implementation we created and put out in the open as open source code. What do they think?