CS 5950/6030--Cloud , IoT and Big data Analytics--Summer II 2018
Instructor: Dr. Ajay K. Gupta
Topics Covered Tweet #CS6030
Office hours and Tutoring help - see syllabus
06/28/2018
- A brief Syllabus discussion
- Discussion on general objectives, requirements
- For discussions and lecture notes material, we will be borrowing heavily from various lectures and seminars by Zack Ives @Upenn, Eyal da Lara @ UToronto, Duen H Chau @GeorgiaTech, Juliana Friere @NYU, etc.
- If you don't have a CS account, get one, see CS Sysadmins.
- Get VirtualBox / VMWare installed on your laptop/desktop.
- Setup your account on AWS Educate and Google Cloud. AWS and GCP allow free trials - you can signup. I will also send you email with instructions on how to get education credits that have been provided free for you from Amazon and Google.
- Cloud intro lecture notes - these will change as I cover the slides in class. First market-oriented view and then little more technical view.
- ASAP but no later than next week Tues class time - let me know the cloud services you want covered for class, topics / problems you want to lead / cover in class (i.e., your class presentations), and your prelim ideas on the semester project(s). No more than 1 page writeup submitted to your elearning dropbox HW1 -- use submission instructions
- sample presentations / project ideas from past offering
- nice comic shared by Ben from http://www.commitstrip.com/en/2015/07/08/true-story-fixing-a-self-ddos/
- read chapter 1 of the Big-Data Analytics for Cloud, IoT and Cognitive Computing book by Hwang and Chen.
- start to go over the tutorials at Geting Started With Google Cloud Platform -- https://cloud.google.com/getting-started/
- read Getting Started Guide from AWS -- http://docs.aws.amazon.com/gettingstarted/latest/awsgsg-intro/gsg-aws-intro.html
- read chapters 1 and 2 of the AWS Certified Solutions Architect book by Baron, Baz, Bixler, Gaut, Kelly, Senior and Stamper.
07/03/2018
- HW1 - class planning, your interests, project prelim - is due on Tuesday July 5, 2018
- Cloud intro2 lecture notes - these will change as I cover the slides in class.
- From each presenter, we are looking for concepts, hands on in-class experience with using / installing a service with example "toy" applications, at least 5 questions and answers (in a MS-Word document as well as uploaded on Elearning's quiz and discussion sections, I will show you how), a homework using your covered service / topic, presentation slides (if any) and any relevant references.
- Project proposals due 7/19/2018 on Elearning dropbox
- If time permits - Cloud types, services, case study examples, virtualization - Intro4
- cloud computing concepts contd, Lisp, MapReduce intro
7/5/2018
- Cloud computing - technical intro - data centers, scalability, challenges and opportunities
- Qwiklab Quest: GCP Essentials - first 2-3 as inclass exercises
- GCP Essentials Quest from Qwiklabs as HW2 due July 10, 2018
7/10/2018
- Intro to cloud, IoT and bigdata - from Hwang and Chen book
- Revised class presentations and lab choices due July 12, 2018 as HW3
7/12/2018
- Technology and architecture overview of cloud, bigdata and IoT -- Smart Cloud, Virtualization and Mashup Services - from Hwang and Chen book
- Qwiklab quest - baseline infrastructure - 2-3 as inclass exercises
- Baseline Infrastructure quest from qwiklabs as HW4 - due July 17, 2018
7/17/2018
- Google Cloud SQL and Big Query by James Rhodes
- Qwiklabs corresponding to above as inclass practice
- Ingesting Data into The Cloud - Qwiklab as HW5 - due July 20, 2018
7/19/2018
- project proposals due today
- Developing and Deploying a Python Flask Web Application to GAE by Andrew Barns
- Inclass practice in slides above
- Codelab: Deploy a Python Flask Web Application to App Engine Flexible as HW6 - due July 24, 2018
7/24/18
- Quiz1
- Bigdata processing using PilotData By Aisya Nafiisyanti
- IoT - technology enablers, architectures, history - from Hwang Chen book
- tentative presentation schedule
- read chapter 3 of the Hwang+Chen book on Big Data Analytics for Cloud, IoT and Cognitive Computing
7/26/2018s
- Machine Learning 101 - lecture notes
- Data Science on Gopgle Cloud Platform by Andre Sihombing
- tentative presentation schedule - revised 7/26/18
- Qwiklab: Visualizing Data with Google Data Studio as HW7 due July 31, 2018
- read chapter 4 of the Hwang+Chen book on Big Data Analytics for Cloud, IoT and Cognitive Computing
7/31/2018
- quiz2
- Cloud Security Scanner and Containers on GCP by Sai Amel Ankem
- Add user authentication to your apps in Java, Python, etc. and Soring Image and Video Files in GCP by Koushik Mandal
- Managing and securing IoT on GCP by Zalak Patel
- read chapter 4 of the Hwang+Chen book on Big Data Analytics for Cloud, IoT and Cognitive Computing
- HW8 - Project title final, team name and team members - due next class but finish it today if you can
- HW9 - Qwiklab: A Tour of IoT Core - due Aug 4, 2018
- HW10 - Qwiklab: Running Windows Containers on Compute Engine - due Aug 4, 2018
August 2018
- BigData, ML and AI - chapters 4, 5 and 6 of the Hwang+Chen book on Big Data Analytics for Cloud, IoT and Cognitive Computing
- mapreduce and dfs contd.
- AWS EMR wordcount example instructions
- mapreduce algorithm design
I will try to cover the following as time permits.
You should, on your own, try to read Chapters 7 and 8 of the Hwang+Chen book on Big Data Analytics for Cloud, IoT and Cognitive Computing - it gives detailed overview of ML in Healthcare and Deep Reinforcement Learning for Social Media Analytics
8/2/2018
- quiz3
- Content Delivery Networks, and CDN in GCP by Rohit Chandrasekhar
- Complete Qwiklab: Cloud CDN in class
- Qwiklab Quest: Network Performance and Optimization is a good one to look at various optimization strategies
- HW11: Qwiklab: VPC Flow Logs - Analyzing Network Traffic - due Aug 7, 2018
- Digital library services related to cloud computing platforms by Wagiyah Wagiyah
- Bookshelf App tutorial - goes through whole process - familiarize yourself with all steps
- HW12: Qwiklab: Cloud Datastore - due Aug 7, 2018
- PDCS, mapreduce and dfs intro
8/7/2018
- ML, AI and TensorFlow on GCP by Corey Oldenberg
- ML Vision and Speech APIs in GCP by Swarnava Chatterjee - partial slideset, revised version will be uploaded soon
- NLP+ML in cloud by Agung Wahyudiono
- HW13 - Classify Text into Categories with the Natural Language API: Qwiklab - due August 9, 2018
- project presentation and demo schedule
8/9/2018
- quiz4
- Amazon Web Services day - have your AWS account set and ready to go
- AWS EC2, creating a VPC with public and private subnets, LAMP / RDS by Rohit Bandooni
- AWS Elatic Block Storage (EBS) by Jayasree Nayana
- AWS Database, DynamoDB, Lambda, Elastic Container Service by Vadivu Sampath
8/14/2018
- Project presentations - 15 minutes for each presentation.
- project presentation and demo schedule
- Project submisison by 11:30pm on 8/16/18-- two files to be submitted through your CS6030 elearning -- first file: project report (equivalent of a technical paper using standard IEEE/ACM style) in pdf; second file: a zip file of your slides, source-code, scripts, readme file, screenshots, and any other supporting files/docs needed to run your project.
- project report samples - sample1, sample2, sample3
- Note: modify your report to fit your project needs, main components of the report are (1) Title, (2) abstract, (3) Introduction/ project statement / project goals (motivates and describes your project problem and its goals, (4) intermediate sections depends on project - design considerations, sw/hw architectures, schematics, sample screenshots of main functionalities / UIs, technologies used etc go here, (5) Challenges faced, (6) Conclusions and Future Work, (7) References, and (8) Appendices - source code listings, class diagrams,, screenshots of GUIs, etc
- Project presentation, project reviews and project submission due 08/16/18 by 11:30pm
8/16/2018
- 3:30-6:00pm in C0123, CEAS -- exam in class on eLearning/paper. Closed notes, closed book, no internet access other than to your exam on eLearning, no texting, no chatting. Only blank scrap paper allowed.
- don't forget to write your name on the sign-in sheet before you leave
- comprehensive exam -- all topics covered throughout the semester included
- Project presentations contd.
Good Luck and have a wonderful rest of the summer!
General Instructions on submitting your homeworks.
- For programming assignments, submit a SINGLE zipped file of your source codes, scripts (to run your program if any) and a brief report along with a copy of a couple of sample executions of your solution to the class's Elearning. No need to say, but you should be using good conventions and programming practices in developing your programs [just in case you forgot, refresh them from some of the coding conventions etc links provided on the TopicsCovered page or dig-up on your own!]
- Use <hw#cs6030_yourlastname_mmddyy.{zip,ppt,doc,tex}> as the naming convention for your zipped, pdf, ppt, MS-Word, or LaTex files when submitting on Elearning. Replace '#' with the appropriate homework number. Also replace cs6030 with cs5950 if you are enrolled in that section.
- There will be significant point penalties for not following the naming convention above, good programming practices or submitting a different format of archive file or if your program does not run. Make sure it is a .zip and NOT another format (no .rar, .tar, .tar.gz, etc)
Any student may be asked to show and discuss his or her solution in class, so be ready with your presentation.
Programming Practices, useful and related links:
- a brief from java.sun.com
- CodeConventions from Oracle - an extensive 20 page doc - available at http://www.oracle.com/technetwork/java/codeconv-138413.html
- 10 Most Useful Java Best Practice Quotes for Java Developers
- Collected Java Practices from javapractices.com (warning - its an extensive collection).
- Java tutorials at http://docs.oracle.com/javase/tutorial/
- Many links at http://www.ontko.com/java/java_coding_standards.html, local link.
- from quora - What is an intuitive explanation about how threads work?
- more general link from quora on parallel computing
- interesting read - HiPC newsletter February 2015