classes
i523
Lessons
Changes
Fork
Site
I523 Big Data Applications and Analytics (2017)
Content
Additional Content
I523 Big Data Applications and Analytics (2016)
Overview
Syllabus
Refcards
Homework
Gitlab
Python for Big Data
Python Fingerprint Example
Refcards
4.1. Linux Shell
2.2. LaTeX
Ubuntu Development Configurations
Using SSH Keys
Homework References
1.5. Contributing
I524 Big Data and Open Source Software Projects (2017)
Preface
i524 Big Data and Open Source Software Projects (2017)
I524 Calendar
I524 Lectures
I524 Assignments
HID Assignment
Technologies
B534 Cloud Computing (2018)
Content
FAQ
What are the prerequisites for this class?
I am full time student at IUPUI. Can I take the online version?
I am a residential student at IU. Can I take the online version only?
The class is full what do I do?
Do I need to buy a textbook?
Do I need a computer to participate in this class?
Where is the official IU calendar for the Fall?
How do I ask a question?
How to write a research article on computer science?
Which bibliography manager is required for the class?
Can I use endnote or other bibliography managers?
Plagiarism test and resources related to that
How many hours will this course take to work on every week?
Is all classes material final?
What are the changes to the web page?
What lectures should I learn when?
I524: Why are you doing the papers?
I524: Why are there no homework to test me on skills such as ansible or python?
I524: Why not use chef or another DevOps framework?
I am lost?
I do not like Technology/Topic/Project/etc?
I am not able to attend the online hours
Do I need to attend the online sessions?
What are the leaning outcomes?
There are so many messages on Piazza I can not keep up.
I do not know python. What do I do?
Tips: TechList.1 homework
Citations
Spelling
Github
Rubric
Timeliness
Outdated Tech ology
Techlist 1 and Paper 1 : Pagecount
Tips to Install Virtualbox
Do I generate the SSH key on Ubuntu VM ?
Ways to run Ubuntu on Windows 10
Don’t use Anaconda
Using SSH Key for Git Push
Can I write the papers on OSX?
What is the nature of team collaboration on papers
What is the nature of team collaboration on papers
What are the due dates for assignments
What are good places to find refernce entries?
How to install Matplotlib?
How to test if your OS can install cloudmesh_client
Windows
OS X
Linux
Tips to write a Good Paper
Lessons
Cloud (under construction)
Contributing
Writing Documents
Linux
Organization
Programming
Data
Software Projects
Software Projects
IaaS
Docker Container
Ansible
Chef
Compaprison of Configuration management software
Composite Clusters (under preparation)
Dynamic deployment of arbitrary X software on VC (under preparation)
Hadoop Virtual Cluster Installation
Ubuntu Juju
MongoDB Virtual Cluster (under preparation)
OpenMPI Virtual Cluster (Under Preparation)
OpenStack Heat
Other (Under Preparation)
DevOps (under preparation)
Overview Virtual Cluster (under preparation)
Puppet
Full Message of
salt-call
Execution
SaltStack
Notebooks
Todos
General
Changelog
%%version%% (unreleased)
3.2.0 (2017-08-07)
3.1.1 (2017-02-19)
3.1.0 (2017-02-10)
3.0.9 (2017-01-30)
3.0.8 (2017-01-22)
3.0.7 (2017-01-20)
3.0.6 (2017-01-11)
3.0.5 (2017-01-11)
3.0.4 (2017-01-09)
3.0.3 (2017-01-09)
3.0.2 (2017-01-07)
3.0.1 (2017-01-06)
3.0 (2017-01-06)
Page
3. Theory Track
Site
I523 Big Data Applications and Analytics (2017)
Content
Additional Content
I523 Big Data Applications and Analytics (2016)
Overview
Syllabus
Refcards
Homework
Gitlab
Python for Big Data
Python Fingerprint Example
Refcards
4.1. Linux Shell
2.2. LaTeX
Ubuntu Development Configurations
Using SSH Keys
Homework References
1.5. Contributing
I524 Big Data and Open Source Software Projects (2017)
Preface
i524 Big Data and Open Source Software Projects (2017)
I524 Calendar
I524 Lectures
I524 Assignments
HID Assignment
Technologies
B534 Cloud Computing (2018)
Content
FAQ
What are the prerequisites for this class?
I am full time student at IUPUI. Can I take the online version?
I am a residential student at IU. Can I take the online version only?
The class is full what do I do?
Do I need to buy a textbook?
Do I need a computer to participate in this class?
Where is the official IU calendar for the Fall?
How do I ask a question?
How to write a research article on computer science?
Which bibliography manager is required for the class?
Can I use endnote or other bibliography managers?
Plagiarism test and resources related to that
How many hours will this course take to work on every week?
Is all classes material final?
What are the changes to the web page?
What lectures should I learn when?
I524: Why are you doing the papers?
I524: Why are there no homework to test me on skills such as ansible or python?
I524: Why not use chef or another DevOps framework?
I am lost?
I do not like Technology/Topic/Project/etc?
I am not able to attend the online hours
Do I need to attend the online sessions?
What are the leaning outcomes?
There are so many messages on Piazza I can not keep up.
I do not know python. What do I do?
Tips: TechList.1 homework
Citations
Spelling
Github
Rubric
Timeliness
Outdated Tech ology
Techlist 1 and Paper 1 : Pagecount
Tips to Install Virtualbox
Do I generate the SSH key on Ubuntu VM ?
Ways to run Ubuntu on Windows 10
Don’t use Anaconda
Using SSH Key for Git Push
Can I write the papers on OSX?
What is the nature of team collaboration on papers
What is the nature of team collaboration on papers
What are the due dates for assignments
What are good places to find refernce entries?
How to install Matplotlib?
How to test if your OS can install cloudmesh_client
Windows
OS X
Linux
Tips to write a Good Paper
Lessons
Cloud (under construction)
Contributing
Writing Documents
Linux
Organization
Programming
Data
Software Projects
Software Projects
IaaS
Docker Container
Ansible
Chef
Compaprison of Configuration management software
Composite Clusters (under preparation)
Dynamic deployment of arbitrary X software on VC (under preparation)
Hadoop Virtual Cluster Installation
Ubuntu Juju
MongoDB Virtual Cluster (under preparation)
OpenMPI Virtual Cluster (Under Preparation)
OpenStack Heat
Other (Under Preparation)
DevOps (under preparation)
Overview Virtual Cluster (under preparation)
Puppet
Full Message of
salt-call
Execution
SaltStack
Notebooks
Todos
General
Changelog
%%version%% (unreleased)
3.2.0 (2017-08-07)
3.1.1 (2017-02-19)
3.1.0 (2017-02-10)
3.0.9 (2017-01-30)
3.0.8 (2017-01-22)
3.0.7 (2017-01-20)
3.0.6 (2017-01-11)
3.0.5 (2017-01-11)
3.0.4 (2017-01-09)
3.0.3 (2017-01-09)
3.0.2 (2017-01-07)
3.0.1 (2017-01-06)
3.0 (2017-01-06)
3. Theory Track
Web Links
i523
i524
3. Theory Track
¶
3.1. Introduction
3.1.1. Course Motivation
3.2. Overview of Data Science
3.2.1. Data Science generics and Commercial Data Deluge
3.2.2. Data Deluge and Scientific Applications and Methodology
3.2.3. Clouds and Big Data Processing; Data Science Process and Analytics
3.2.4. Clouds
3.3. Big Data Use Cases Survey
3.3.1. Overview of NIST Big Data Public Working Group (NBD-PWG) Process and Results
3.3.2. 51 Big Data Use Cases
3.3.3. Features of 51 Big Data Use Cases
3.4. Health Informatics Case Study
3.4.1. X-Informatics Case Study: Health Informatics
3.5. e-Commerce and LifeStyle Case Study
3.5.1. Recommender Systems: Introduction
3.5.2. Recommender Systems: Examples and Algorithms
3.5.3. Item-based Collaborative Filtering and its Technologies
3.6. Physics Case Study
3.6.1. Looking for Higgs Particles, Bumps in Histograms, Experiments and Accelerators (Part 1)
3.6.2. Looking for Higgs Particles: Python Event Counting for Signal and Background (Part 2)
3.6.3. Looking for Higgs Particles: Random Variables, Physics and Normal Distributions
3.6.4. Looking for Higgs Particles: Random Numbers, Distributions and Central Limit Theorem (Part 3)
3.7. Radar Case Study
3.7.1. Introduction
3.7.2. Remote Sensing
3.7.3. Ice Sheet Science
3.7.4. Global Climate Change
3.7.5. Radio Overview
3.7.6. Radio Informatics
3.8. Sensors Case Study
3.8.1. Internet of Things
3.8.2. Robotics and IOT Expectations
3.8.3. Industrial Internet of Things
3.8.4. Sensor Clouds
3.8.5. Earth/Environment/Polar Science data gathered by Sensors
3.8.6. Ubiquitous/Smart Cities
3.8.7. U-Korea (U=Ubiquitous)
3.8.8. Smart Grid
3.8.9. Resources
3.9. Sports Case Study
3.9.1. Sports Informatics I : Sabermetrics (Basic)
3.9.2. Sports Informatics II : Sabermetrics (Advanced)
3.9.3. Sports Informatics III : Other Sports
3.10. Web Search and Text Mining
3.10.1. Web Search and Text Mining I
3.10.2. Web and Document/Text Search: The Problem
3.10.3. Information Retrieval leading to Web Search
3.10.4. History behind Web Search
3.10.5. Key Fundamental Principles behind Web Search
3.10.6. Information Retrieval (Web Search) Components
3.10.7. Search Engines
3.10.8. Boolean and Vector Space Models
3.10.9. Web crawling and Document Preparation
3.10.10. Indices
3.10.11. TF-IDF and Probabilistic Models
3.10.12. Resources
3.10.13. Web Search and Text Mining II
3.10.14. Data Analytics for Web Search
3.10.15. Link Structure Analysis including PageRank
3.10.16. Web Advertising and Search
3.10.17. Clustering and Topic Models
3.10.18. Resources