• No results found

MIS Big Data Information Systems

N/A
N/A
Protected

Academic year: 2021

Share "MIS Big Data Information Systems"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

MIS 484-4 Big Data Information Systems Chetan (Chet) Kumar, PhD

Associate Professor of Information Systems California State University San Marcos ckumar@csusm.edu

COURSE DESCRIPTION

The aim of this course is to examine insights and uses of Cloud computing and Big Data. It focuses on the applications of data models in the Cloud computing and Big Data environments. This course will describe Big Data, unstructured data types, and their uses. Discussions will include impact of factors such as Variety, Velocity, and Volume of Big Data. It will describe Cloud computing, its uses, benefits, and challenges. The course will examine the emerging data models for these areas. This includes the relevance of data supersets and importance of correlation vs causality. A number of Big Data tools such as Hadoop and NoSQL databases will be illustrated in this course. The course will also examine Big Data applications in different areas such as Retail, Health, Finance, Supply Chain / Logistics, and Marketing. Students will actively participate in this course through lab projects, case, and project presentations.

COURSE OBJECTIVES Students will learn to:

1. Describe Cloud computing and its impact on decision support, analytics of cloud, costs, and security

2. Discuss the origins of Big Data, Hadoop, and Map / Reduce

3. Describe the impact of V3 = Variety, Velocity, Volume and unstructured data on Big Data

4. Recognize the importance of n=All data superset availability

5. Examine the data models for Big Data including evaluating correlation vs causality 6. Apply the software technologies of Big Data including Hadoop and NoSQL databases 7. Analyze Big Data and Cloud computing applications in different areas such as Retail, Health, Finance, Supply Chain / Logistics, and Marketing

8. Evaluate the emerging trends of Big Data and Cloud computing

RECOMMENDED BOOKS AND REFERENCES

Big Data: A Revolution That Will Transform How We Live, Work, and Think, Viktor

(2)

Statistical and Machine-Learning Data Mining: Techniques for Better Predictive

Modeling and Analysis of Big Data, Second Edition, Bruce Ratner, 2011

Hadoop Beginner's Guide, Garry Turkington, 2013

COURSE STRUCTURE

Lecture Materials: In addition to the textbook, materials used in this class include lecture videos, slides and notes, which will be made available via course website. Students should prepare this material before class and may be asked to discuss this in class. The answers will count towards class participation grade.

Lab Exercises: All lab exercise will be based on discovery learning, by which students, individually or as a team, with the instructor’s help, will solve assigned problems. Exams: Two non-cumulative exams will be in-class, closed notes. Make-up exam will be given only in exceptional circumstances when the student has a genuine medical or personal emergency for which they can provide official documentation.

Project: Every student must implement a class project. All project topics must be discussed with and approved by the instructor. Students will be assigned to project topics and Big Datasets in collaboration with instructor, and students are expected to coordinate and work effectively through the semester. All projects must have an in-class presentation which includes demonstrating the project Big Data implementation, done with PowerPoint and Big Data Toolsets. Each project has the following requirements:

Written Paper: A formal written paper describing your project (10 pages double spaced) must be submitted.

In Class Presentation: Each individual must present their work to the class

(approximately 10-15 minutes) with a PowerPoint and Big Data Project presentation. The project presentation may be done around a Big Data dataset topic theme to facilitate presentation.

Homework: Students will periodically be assigned homework exercises. The homework assignment solutions have to be printed out / written as a hard copy and submitted in class on time unless otherwise stated. 5 minutes late is same as one day late. Late submission will be deducted 20% from the original score each day.

Participation: You are expected to attend all scheduled classes and participate in discussions. Attendance may be taken daily and will influence your participation grade.

GRADING POLICY Points:

2 Exams 60% (30% each)

Approximate score to grade conversion: A 95-100%

(3)

Project 30% Homework 7.5% Participation 2.5%

A- 90-94% B+ 85-89% B 80-84% B- 75-79% C+ 70-74% C 65-69% D 60-64% F Less than 60%

COURSE POLICIES

Readings:Assigned readings, lecture videos, and lecture slides, should be prepared before class. They are meant to be supplementary to the class session, and very helpful for class discussion and activities.

Assignments/Project Submission: All assignments/projects should be submitted in class on the assigned submission date and time before class session commences. Five minutes late is same as one day late. Late submission will result in a reduction of the grade by 20% for each calendar day for that late assignment.

Academic Integrity: Student will be expected to adhere to standards of academic honesty and integrity, as outlined in the Student Academic Honesty Policy. All written work and oral presentation assignments must be original work. All ideas/materials that are borrowed from other sources must have appropriate references to the original sources. Any quoted material should give credit to the source and be punctuated with quotation marks. Students are responsible for honest completion of their work including examinations. There will be no tolerance for infractions. If you believe there has been an infraction by someone in the class, please bring it to the instructor’s attention. The instructor reserves the right to discipline any student for academic dishonesty, in accordance with the general rules and regulations of the university. Disciplinary action may include the lowering of grades and/or the assignment or a failing grade for an exam, assignment, or the class as a whole.

Class Participation and Behavior: We are all adults, so attendance is your responsibility. Note that Class Participation forms an important component of the final grade. Students can only score well in that if they regularly attend and actively participate in class discussions. Being prepared for class by reviewing the assigned course material before class is an important part of class participation. Therefore students may be quizzed by the instructor for class preparation. Thorough answers by the students will be a good plus point for the class participation grade. Each student brings experience that others in the class can benefit from and thus individual attendance can enhance the learning experience of the entire class (including the instructor). Students are expected to respect the rights of their classmates by exhibiting behavior that is conducive (or not disruptive) to the

(4)

learning environment of the classroom. Cell phones should be turned off or put on silent ring or vibrate. Phone calls should not be made or taken during the class period. If there is an emergency situation that prevents you from adhering to this policy, please see your instructor to discuss it as soon as possible. Students are expected to refrain from using computers during class time for activities that are NOT related to the class topic being discussed that day. These activities include, but are not limited to, playing videogames, instant messaging, social networking, email, etc. If you miss a class, it is your responsibilityto obtain notes from a fellow student.  University Writing Requirement: CSUSM has established a 2500-word writing

requirement (10 pages) for each course. This requirement will be met primarily via the final project, homework assignments, and essay questions in examinations. All reports should be free of grammatical and spelling errors, typed, and as concise as possible.

Students with Disabilities. It is this institution’s policy to not discriminate against qualified students with documented disabilities. Persons with disabilities are welcome to attend all classes, programs, and events. If you need accommodations, or have questions about access to buildings where learning activities are held, please contact the Office of Disabled Student Services (DSS) in Suite 5205 in Craven Hall, or via phone at (760) 750-4905; (760) 750-4909 (TDD), or via email at dss@csusm.edu. If you need assistance during a class, program, or event, please contact any member of the DSS staff. If you have a disability-related need for modifying your exam or test environment, notify the instructor during the first week of classes so that your needs can be accommodated in time. You will be asked to present documentation from DSS that describes the nature of your disability and the recommended remedy.  Confidentiality: In order for us to have a free and open learning environment, each

student is expected to respect the confidentiality of any information or material shared in class discussion.

E-mail: All students are required to obtain a CSUSM email account. If you have any questions about the course or need assistance, please email me through the above mentioned email at any time.

Special Note: Keep your graded work until you receive your official grade, for there might be recording errors by the instructor.

Big Data Course Schedule

(tentative, may be updated)

Date Topic Tasks

(refer Cougar Courses for updated deliverables due dates)

Week 1 Course Overview

Big Data and Cloud computing introduction

Hwk-Ch1

(5)

Week 2 Ch 1 Cloud computing

Ch 2 Decision support/Analytics of cloud Hwk-Ch2

Week 3 Ch 3 Big Data: Origin of Hadoop Hwk-Ch2

Due: Hwk-Ch1 Ch 3 Big Data: Origin of Map / Reduce Hwk-Ch3

Week 4 Ch 4 V3 = Variety, Velocity, Volume Hwk-Ch3

Ch 5 n=All data superset Hwk-Ch4

Due: Hwk-Ch2, Project Topic

Week 5 Ch 6 Big Data Model: Impact to causality Hwk-Ch4

Ch 6 Big Data Model: Impact to causality Hwk-Ch6

Due: Hwk-Ch3, Project Outline

Week 6 Ch 6 Big Data Model: Impact to causality Hwk-Ch6

Ch 6 Big Data Model: Sentiment analysis Hwk-Ch6

Week 7 Project Consultation Due: Hwk-Ch4

Review Session for Exam 1 Due: Hwk-Ch5

Week 8 Exam 1 Due: Hwk-Ch6

Ch 7 Big Data Technologies 1 Hwk-Ch7

Due: Project Big Data Collection

Week 9 Ch 7 Big Data Technologies 2 Hwk-Ch7

Ch 7 Big Data Technologies 3 Hwk-Ch8

Week 10 Ch 8 Types of unstructured data Hwk-Ch8

Due: Hwk-Ch7 Ch 9 Big Data Scalability Hwk-Ch9

Week 11 Ch 10 Application of Big Data: Healthcare Hwk-Ch9

Ch 10 Application of Big Data: Finance Hwk-Ch9

Week 12 Ch 10 Application of Big Data: Supply Chain /

Logistics

Hwk-Ch10 Ch 10 Application of Big Data: Marketing Hwk-Ch11

Due: Hwk-Ch8, Project Big Data Analysis

Week 13 Ch 11 Big Data toolset: NoSQL databases,

Hadoop

Hwk-Ch14 Ch 11 Big Data toolset: Map / Reduce, HDFS

Yarn

Due: Hwk-Ch9

Week 14 Ch 11 Big Data toolset: Hive / Pig / Scoop Due: Hwk-Ch10

Project Consultation

Week 15 Review Session for Exam 2

Project Presentations

Due: Hwk-Ch11 Project Presentations

Week 16 Exam 2

References

Related documents

c) for a claim against a health service provider – a copy of any consent given to the health service provider by the injured person about the treatment claimed to have given rise to

Ethnicity was also associated with diabetes; after adjustment, South and East Asian women had three times higher odds of diabetes in pregnancy compared to White women.. We found

Remove Marcs domain user account from the Windows SBS Remote Web Workplace Users security group by using Active Directory Users and Computers.. Remove Marcs domain user account

All students MUST register within the first two (2) weeks of each semester. A registered student who has been permitted to withdraw from studies shall be refunded the tuition fees

The problems associated with contracts of adhesion (uneven bargaining power, oppressive terms, and widely recognized lack of readability), the lack of effective ex

Even more detrimental to your spiritual (true) life, you began to think all the emanations from your self created thought, energy, emotion, structures were originating from you..

DuCaP (a CRPC cell line that has high AKR1C3 expression levels) used and converted DHEA under hormone-depleted conditions into T and DHT. The DHEA-induced growth of DuCaP could

If the Race Committee signals a shortened course at the end of the first lap, boats shall round Mark 2p to port then sail through the finishing line. Boats failing to finish by