Features (7) |
Software (4) |
Courses, Events (2) |
Webcasts (1) |
Meetings (2) |
Jobs (6) |
Academic (2) |
Competitions (2) |
Publications (8) |
NewsBriefs (6) |
CFP (11) |
Quote
Features
- New KDnuggets Poll: Highest Education Level? - Aug 27, 2012.
New KDnuggets Poll is asking: What is your highest education level? Please vote
- Poll Results: Data types analyzed/mined in the past 12 months - Aug 26, 2012.
Tabular data remains most popular (analyzed by over 70% of data miners), but text, XML, and social network data grow in popularity in 2012.
- Strata Big Data Conferences in London (Oct 1-2) and New York (Oct 23-25) - a free pass - Aug 16, 2012.
Join the best minds in data to explore the latest in the data revolution: trends, tools, new practices, careers & culture, and the discussion around ethics, policy, and privacy. KDnuggets is giving away one free pass to each conference - see details.
- PAW Boston: meet the Top Predictive Analytics Influencers - Aug 27, 2012.
Meet the top predictive analytics influencers (including KDnuggets editor) who will gather at Predictive Analytics World Boston, Sep 30 - Oct 4, 2012. Special KDnuggets discount!
- Data Analysis Conference 2-for-1 registration - Aug 24, 2012.
Data Analysis Conference: Tools of the Trade will bring together data analysts and researchers from academe, industry and government to examine the depth and breadth of modern analytic software. Gregory Piatetsky-Shapiro will deliver the keynote on "Data Mining, Predictive Analytics and Big Data: The reality and the hype"
- Top news for Aug 19-25: BigData Choice: Which database to use? Stanford Big Data Mining Online - Aug 26, 2012.
Interesting Projects in Machine Learning, Data Mining; Stanford Online; BigData Choice: Which database to use?;
Top jobs: Lead Data Scientist / IT Specialist at CFPB (Consumer Financial Protection Bureau); Data Scientist - Analytics at LinkedIn
- Top news for Aug 12-18: Big Data Cartoon; Interesting Open-source ML/Data Mining projects - Aug 19, 2012.
Big Data Cartoon: what do to with 100,000 Warehouses?; Interesting Open-Source Projects in Machine Learning, Data Mining, Data Science;
Top jobs: Algorithms Scientist at Guardian Analytics; Statistical Scientist - Predictive Modeling at Experian RnD Data Lab .
Software
- BigData Choice: Which database to use? - Aug 21, 2012.
In the era of big data, RDBMS is no longer the only choice. Here is a guide for what type of DB to choose, including Key-value pair, Column family/big table, Document, and Graph databases.
- Apache Drill clones Google Dremel Real Time Big Data Tool - Aug 21, 2012.
Apache Drill projects wants to build an open source version of Google Dremel, which allows real-time querying of Big Data, and is suitable for processing streaming data. This will be a big step beyond Hadoop, which was designed for batch processing.
- Google Dremel makes Big Data interactive, goes way beyond Hadoop - Aug 16, 2012.
Google Dremel is a scalable, interactive ad-hoc query system for Big Data which greatly exceeds Hadoop capabilities. By combining multi-level execution trees and columnar data layout, it can aggregate trillion-row tables in seconds. You can use Dremel today.
- 11Ants Analytics Predictor for Oracle - Aug 24, 2012.
11Ants Predictor for Oracle is a high speed scoring engine for deploying predictive models to Oracle enterprise databases. It enables users to easily deploy predictive models built with 11Ants Analytics desktop modeling tools
Courses, Events
Webcasts
Meetings
Jobs
- Analytics Consultant/Strategist at Aviana, San Francisco Bay Area, Los Angeles, Orange County, San Diego, Denver, Atlanta - Aug 27, 2012.
Critical for our success are consultants with an astute business acumen, who can speak the language of business and connect the various components of our BA offerings with concrete business outcomes. Hiring at 3 levels: experienced consultants, mid-level, and junior consultants.
- Sr. Customer Experience Analyst at Medical Mutual, Cleveland, OH - Aug 24, 2012.
Supports implementation and continuous improvement of the Customer Experience team analytics program; deliver analytic insights by conducting in-depth quantitative analysis supporting strategic business decisions.
- Business Analytics and Optimization Consultant at IBM, Columbus, OH - Aug 23, 2012.
Develop innovative solutions to solve complex business and technical issues across 17 industries, as part of IBM integrated consulting services. Make a difference for top-tier global businesses and public sector clients.
- Software Development Engineer-Search Analytics at A9, Palo Alto, CA - Aug 20, 2012.
Design, implement, and maintain large-scale distributed systems to process immense volumes of real-world data from Amazon.com and make it available for programmatic and human consumption.
- Algorithms Scientist (PhD) at Guardian Analytics, Mountain View, CA - Aug 17, 2012.
You are an accomplished mathematician/statistician/computer scientist able to develop, analyze and program cutting-edge algorithms for fraud prevention applications.
- Statistical Scientist - Predictive Modeling at Experian R&D Data Lab, San Diego, CA - Aug 17, 2012.
Join Experian CSDA (Credit Services and Decision Analytics) R&D Data Lab concentrating on new data asset evaluation and acquisition as well as new product prototyping.
Academic/Research positions
- Faculty (Tenure Track) at U. of Iowa, Tippie College of Business, Iowa City, IA - Aug 21, 2012.
Recruiting for a tenure track faculty position starting fall 2013 in analytics, optimization, machine learning, and statistics or related areas.
- Postdoc in Machine Learning, Data Mining for Bioinformatics Applications at Hellas Foundation for Research and Technology, Heraklion, Crete, Greece - Aug 23, 2012.
Bioninformatics Labs is looking for energetic and intelligent researchers to to lead the development of novel causal discovery and machine learning methods for bioinformatics and computational biology problems.
Competitions
Publications
- Practical Text Mining Book Chapter: 7 Practice Areas - free download - Aug 27, 2012.
This chapter organizes text analytics methods as seven complementary practice areas, showing how to select amongst them for your objectives.
- Mining of Massive Datasets Book - revised, free to download - Aug 16, 2012.
This excellent book by top Stanford researchers covers Data Mining, Map-Reduce, Finding similar items, Mining Data Streams, and much more. It was revised and published, but a version is still free to download
- KDnuggets Twitter connection map in NodeXL - Aug 27, 2012.
This interesting map shows a directed network, clustered by keywords,, of twitter users whose recent tweets contained kdnuggets.
- US Government DNI Data Mining Report - Aug 20, 2012.
This unclassified report of 2011 data mining activities is done by the Office of the Director of National Intelligence (ODNI) as requested by Congressional Data Mining Reporting Act.
- Top KDnuggets tweets, Aug 23-26: Stanford: Data Mining and Statistics Courses Online; Hack/reduce, Boston-area, big-data facility - Aug 27, 2012.
Stanford: Data Mining and Statistics Courses Online; Hack/reduce, Boston-area, big-data facility plans to produce 1000 #BigData experts; Bigger than #BigData: Facebook processes 2.5 B content items, 2.7B Likes; Hot topic! Streaming Data Mining Tutorial slides from KDD-2012
- Top KDnuggets tweets, Aug 20-22: How TripAdvisor succeeded; Must read for new Data Scientists: Getting started with R/Hadoop - Aug 23, 2012.
How TripAdvisor succeeded - powerful network effects, an amazing business model ; Must read for new Data Scientists: Getting started with R and Hadoop; Eight Principles of Data Visualization; BigData Machine Learning and Predictive Analytics Cheat Sheet
- Top KDnuggets tweets, Aug 16-19: Cookbook for R: solutions to common tasks and problems; Mining of Massive Datasets Book, free to download - Aug 20, 2012.
Cookbook for R: solutions to common tasks and problems; Mining of Massive Datasets Book, by top Stanford researchers, free download; Costliest Lesson? Guess which company lost $45B in market cap; Character social networks in movies - cool, useful viz
- Top KDnuggets tweets, Aug 13-15: Interesting Open-Source Projects in ML, Data Science; Scalable Machine Learning course at Berkeley - Aug 16, 2012.
Interesting Open-Source Projects in Machine Learning, Data Mining, Data Science; SML: Scalable Machine Learning course at Berkeley, lectures, more ; Interviewing Data Scientists: 5 core skills; Why Data Science is so strong in India
News Briefs
- Romney uses secretive data-mining - Aug 27, 2012.
Romney campaign began a secretive data-mining project this summer to sift through Americans personal information - including their purchasing history and church attendance - to identify new and likely wealthy donors.
- Aerospike buys AlchemyDB, launches community edition, gets new funding - Aug 27, 2012.
Aerospike (formerly Citrusleaf) added Alchemy team to integrate AlchemyDB index, document store, graph database, and SQL functionality into Aerospike Database; Launched community edition which can handle >200K TPS on commodity servers; Attracts Series B Funding and adds Don Haderle, Father of IBM DB2 as advisor.
- DataWeek Top Innovator Winners in 19 categories - Aug 23, 2012.
A common trend among winners is that most of their technologies are available online and as-a-service, offering data technology to even small businesses and startups
- US intelligence tests crowd-sourcing against experts - Aug 22, 2012.
Can large groups be better predictors of terrorism and international events than analysts in government spy services?
- SAS, LSU launch MS in Analytics - Aug 20, 2012.
In a successful pilot during the 2011-2012 school year, all graduates found employment in weeks with companies like Amazon.com, Bank of America and SAS.
- Big Data Analytics take majority of VC Funding- Aug 15, 2012.
Within Big Data, analytics companies take over 50% of recent Venture Capital deals, followed by big data infrastructure and applications.
CFP - Calls for Papers
- SocialInformatics: 2012 ASE/IEEE International Conference on Social Informatics, due Sep 15
- ECMLPKDD 2014: ECML/PKDD 2014 proposals, due Sep 17
- DBKDA 2013: Advances in Databases, Knowledge, and Data Applications, due Sep 22
- CONTEXTAWARE: Machine Learning Approaches to Mobile Context Awareness, due Sep 28
- OPT 2012: NIPS Workshop on Optimization for Machine Learning , due Sep 28
- CIDM-13: Computational Intelligence and Data Mining, due Oct 10
- ml4society: Machine Learning for Science and Society, due Nov 16
- FLAIRS-DM: FLAIRS-2013 Special Track on Data Mining, due Nov 19
- WWW2013: WWW Conference, due Nov 19
- IJCAI-13-tut: IJCAI-13 Tutorial proposals, due Dec 1
- ICML 2013: Int. Conf. on Machine Learning , due Feb 15
Quote
Big Data phenomenon is real, but it needs a better name to earn respect from researchers. Gregory Piatetsky-Shapiro