datacommunitydc.org
Blog — Data Community DC
http://www.datacommunitydc.org/blog?category=Announcements
January 23, 2017. June 20, 2016. Announcements, Community, Events. November 30, 2015. Commentary, Data Science DC, Events, Reviews. Socially Responsible Algorithms at Data Science . Troubling instances of the mosaic effect in which different anonymized datasets are combined to reveal unintended details include the tracking of celebrity cab trips. And the identification of Netflix user profiles. August 1, 2015. Data Science DC's hottest topic is: Hulls? July 27, 2015. June 20, 2016. June 20, 2016. Aaron S...
datacommunitydc.org
Sponsor Us — Data Community DC
http://www.datacommunitydc.org/sponsor-us
PROMOTE YOUR BRAND, CONFERENCE, OR CLASSES. Organizational Sponsorship includes a Full Slide and Announcement at every DC2 Program event, Social Media promotion, an opportunity to post on our blog, and job advertisements in our Newsletter. Our Program events are a subset of our overall Meetup network. Download our prospectus to learn more about our programs. Explore our Network of Talent and Opportunity. And helps you with cool visualizations. Promote your classes to an actively educated audience. A/V Se...
districtdatalabs.silvrback.com
District Data Labs
http://districtdatalabs.silvrback.com/tags/nlp
Hands-on data science tutorials, lessons, and other awesome content. NLP Research Lab Part 2: Skip-Gram Architecture Overview. Editors Note: This post is part of a series based on the research conducted in District Data Labs NLP Research Lab. Make sure to check out NLP Research Lab Part 1: Distributed Representations. Chances are, if you’ve been working in Natural Language Processing (NLP) or machine learning, you’ve heard of the class of approaches called . . . Posted in: machine learning. July 27, 2016.
districtdatalabs.silvrback.com
District Data Labs
http://districtdatalabs.silvrback.com/tags/data%20products
Hands-on data science tutorials, lessons, and other awesome content. The Age of the Data Product. We are living through an information revolution. Like any economic revolution, it has had a transformative effect on society, academia, and business. The present revolution, driven as it is by networked communication systems and the Internet, is unique in that it has created a surplus of a valuable new material - data - and transformed us all . . . Posted in: data products. May 20, 2015.
districtdatalabs.silvrback.com
District Data Labs - Getting Started with Spark (in Python)
http://districtdatalabs.silvrback.com/getting-started-with-spark-in-python
Getting Started with Spark (in Python). Which is implemented as HDFS in Hadoop, and a framework for distributed computing ( MapReduce. These two ideas have been the prime drivers for the advent of scaling analytics, large scale machine learning, and other big data appliances for the last ten years! To address these problems, Hadoop has been moving to a more general resource management framework for computation, YARN. Yet Another Resource Negotiator). YARN implements the next generation of MapReduce, ...
districtdatalabs.silvrback.com
District Data Labs
http://districtdatalabs.silvrback.com/tags/spark
Hands-on data science tutorials, lessons, and other awesome content. Getting Started with Spark (in Python). Is the standard tool for distributed computing across really large data sets and is the reason why you see Big Data on advertisements as you walk through the airport. It has become an operating system for Big Data, providing a rich ecosystem of tools and techniques that allow you to use a large cluster of relatively cheap . . . February 02, 2015. You can also find District Data Labs.
districtdatalabs.silvrback.com
District Data Labs
http://districtdatalabs.silvrback.com/tags/probability
Hands-on data science tutorials, lessons, and other awesome content. Conditional Probability with R. Likelihood, Independence, and Bayes. In addition to regular probability. We often want to figure out how probability is affected by observing some event. For example, the NFL. Season is rife with possibilities. From the beginning of each season, fans start trying to figure out how likely it is that their favorite team will make the playoffs. After every game the team plays, these . . . October 23, 2014.
districtdatalabs.silvrback.com
District Data Labs - What Are the Odds?
http://districtdatalabs.silvrback.com/intro-to-probability-with-r
What Are the Odds? An Intro to Probability with R. Broadly speaking, we can arrive at probabilities for events in two ways:. Probabilities are assigned based on some physical and repeatedly observable state of affairs. Most commonly, we use. Suppose we observe a random event (say, getting heads) as the result of an experiment (flipping a coin). The probability of getting heads is determined by the relative frequency of seeing heads when the experiment is repeated a number of times. Cum prob ,. Fortunatel...
districtdatalabs.silvrback.com
District Data Labs
http://districtdatalabs.silvrback.com/archive
Hands-on data science tutorials, lessons, and other awesome content. Order By → Newest First. Principal Component Analysis with Python. NLP Research Lab Part 2: Skip-Gram Architecture Overview. NLP Research Lab Part 1: Distributed Representations. Beyond the Word Cloud. District Data Labs PyCon Recap. Visual Diagnostics for More Informed Machine Learning: Part 3. Preparing for NLP with NLTK and Gensim. Visual Diagnostics for More Informed Machine Learning: Part 2. Building a Classifier from Census Data.