Duration: 
Eligibility: 

DATASCIENCE

DATA SCEINCE: Data science is the study of the extraction of knowledge from data. It usesvarioustechniquesfrommany fields,includingsignalprocessing,mathematics,probabilitymodels,machinelearning,computerprogramming,statistics,dataengineering,patternmatching, data visualization, uncertainty modeling, data warehousing, and high-performancecomputing with the goal of extracting useful knowledge from the data. Data Science is notrestricted to only big data, although the fact that data is scaling up makes big data an importantaspectofdatascience.

Apersonthatdoesdatascienceiscalledadatascientist.Datascientistssolvecomplicateddata problems using mathematics, statistics and computer science, although very good skill inthese subjects are not required. However, a data scientist is most likely to be an expert in onlyone or two of these disciplines, meaning that cross disciplinary teams can be a key componentofdatascience.

Good data scientists are able to apply their skills to achieve a broad spectrum of end results.Theskill-sets andcompetencies thatdatascientists employ varywidely.

Areas Covered

  • BASEDATAANALYTICS

Data Analytics: - Data Analytics is the science of analyzing data to convert information to usefulknowledge. This knowledge could help us understand our world better, and in many contextsenableustomakebetterdecisions.Whilethisisthebroadandgrandobjective,thelast20years has seen steeply decreasing costs to gather, store, and process data, creating an evenstronger motivation for the use of empirical approaches to problem solving. This course seeks topresent you with a wide range of data analytic techniques and is structured around the broadcontours of the different types of data analytics, namely, descriptive, inferential, predictive, andprescriptiveanalytics.

FieldStudy

  • IntroductiontoAnalyticsandAnalysis
  • IntroductiontoDataAnalytics
  • Terminologies
  • Statics&MachineLearning
  • Tools& BasicPrerequisites
  • Advanced Tools & Prerequisites

PYTHON

Python is an interpreted high-level programming language for general-purpose programming.Python has a design philosophy that emphasizes code readability, notably using significantwhitespace. It provides constructsthat enable clear programming on both small and largescales.Pythonfeaturesadynamictypesystemandautomaticmemorymanagement.Itsupports multiple programming paradigms, including object-oriented, imperative, functional andprocedural,andhasalargeandcomprehensivestandardlibrary.

FieldStudy

  • Introduction
  • Conditionalstatements
  • Looping
  • ControlStatements
  • Lists
  • Tuple
  • Dictionaries
  • Functions
  • Modules
  • Input-Output
  • ExceptionHandling
  • PYTHONAdvance

R Programming

The R language is widely used among statisticians and data miners for developing statisticalsoftware and data analysis. Polls, surveys of data miners, and studies of scholarly literaturedatabases show thatR'spopularityhasincreasedsubstantiallyinrecentyears.

FieldStudy

  • IntroductiontoBasics
  • Vectors
  • Matrices
  • Factors
  • DataFrames
  • Lists
  • Data Transformation Tools

ETLTOOL

ETL is short for extract, transform, load, threedatabase functions that are combined into one tool to pull data out of one database and place it into another database.

Extract is the process of reading data from a database. In this stage, the data is collected, oftenfrommultipleanddifferenttypesofsources.Transformistheprocessofconvertingtheextracted data from its previous form into the form it needs to be in so that it can be placed intoanother database. Transformation occurs by using rules or lookup tables or by combining thedatawithotherdata.

PENTAHOO

Pentahoo is a business intelligence (BI) software that provides data integration, OLAP services,reporting,informationdashboards,dataminingandextract,transform,load(ETL)capabilities.

  • DatabaseProficiency Tools

MYSQL-UserLevel

MySQL is a freely available open-source Relational Database Management System (RDBMS)thatusesStructuredQueryLanguage(SQL).

SQL is the most popular language for adding, accessing and managing content in a database. Itis most noted for its quick processing, proven reliability, ease and flexibility of use. MySQL is anessentialpartofalmosteveryopen-sourcePHPapplication.

FieldStudy

  • IntroductiontoMySQL
  • DesigningDatabases
  • Basic SQL
  • DatabaseStructures
  • DoingAdvancedQueries
  • AdvancedMySQLConcepts

ROBOTICPROCESSAUTOMATION(RPA)

“THEAUTOMATIONOFKNOWLEDGEWORKWILLBETHISDECADE’SENGINEOFGROWTH”

Robotic process automation (RPA) is the application of technology that allows employees in acompanytoconfigurecomputersoftwareora“robot”tocaptureandinterpretexistingapplicationsforprocessingatransaction,manipulatingdata,triggeringresponsesandcommunicatingwithotherdigital systems.

Any company that uses labor on a large scale for general knowledge process work, wherepeopleareperforminghigh-volume, highlytransactionalprocess functions,willboosttheircapabilitiesand savemoneyandtimewithroboticprocess automationsoftware.

Just as industrial robots are remaking the manufacturing industry by creating higher productionrates andimprovedquality,RPA “robots”arerevolutionizingthewaywethink aboutandadministerbusinessprocesses,ITsupportprocesses,workflowprocesses,remoteinfrastructure and back-office work. RPA provides dramatic improvements in accuracy and cycletime and increased productivity in transaction processing while it elevates the nature of work byremovingpeoplefromdull,repetitivetasks.

Field Study

  • IntroductiontoRPA&ArtificialIntelligence
  • ImplementationPlan
  • RPATools
  • Methodology
  • Impacts
  • Applications
  • Practicalsection

Duration 150Days