+ - 0:00:00
Notes for current slide
Notes for next slide

Data science: A game changer for science and innovation

Thiyanga S. Talagala, PhD

Department of Statistics, University of Sri Jayewardenepura

3 February 2022

1

Data science: A game changer for science and innovation

2
3
4
5
6
7

Algorithm

a set of instructions used to solve a problem

8

Algorithm

a set of instructions used to solve a problem

MEDIPI (MEDIicinal Plant Identification) algorithm

9

MEDIPI (MEDIicinal Plant Identification) algorithm

10

MEDIPI (MEDIicinal Plant Identification) algorithm

11

MEDIPI (MEDIicinal Plant Identification) algorithm

12

Can you patent an algorithm?

13
14

Largest machine learning and artificial intelligence (AI) patent owners - 2020

Data: https://www.statista.com/statistics/1062360/autonomous-driving-patent-owners-japanese-authority/

15

Facebook: Scan photos for brands and see what products you like

16
17

Better change of securing a patent

18
19
  • Model building: Given data predict the likelihood of Preeclampsia (a pregnancy complication characterized by high blood pressure and signs of damage to another organ system, most often the liver and kidneys)

Y=f(X)

  • Incorporate this into the device to generate an alert when the likelihood of having Preeclampsia is high.
20
21
22

Role of Statisticians

23

Prof. Laleen Karunanayake

24

Prof. Upul Subasinghe

25

Statistics: The Science of Data

  1. Data collection

    • Design of Experiments
  2. Data visualization

  3. Data analysis

  4. Interpretation of Results

26

Open Science

"the movement to make scientific research (including publications, data, physical samples, and software) and its dissemination accessible to all levels of society, amateur or professional"

source: https://en.wikipedia.org/wiki/Open_science

27

Reproducibility

"Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them."

source: https://www.coursera.org/learn/reproducible-research

28

Open source software authored by me

tea: R package for tea exporting countries

mozzie: R package for dengue cases in Sri Lanka

colmozzie: R package for dengue cases and climate variables in Colombo Sri Lanka

m4comp2018: R package for M4 Competition time series data

DSjobtracker: R package containing information related to data science job advertisements. What skills and qualifications are required for data science related jobs?

MedLEA: The MedLEA package provides morphological and structural features of 471 medicinal plant leaves and 1099 leaf images of 31 species and 29-45 images per species.

29

Open source software authored by me

ceylon: An R package to plot maps of Sri Lanka

covid19srilanka: An R package to get tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) epidemic in Sri Lanka.

seer: R package for feature-based time series forecasting.

tsfeatures: R package tsfeatures provides methods for extracting various features from time series data.

explainer: Take a peek inside a random forest.

tsdataleaks: R Package for detecting data leakages in time series forecasting competitions.

nic: Nature inspired colour palette for data visualization.

30
31

Impact

seer package downloads

32
33

Small things matter a lot!

  • Give it a catchy name

  • Add a logo

34

Thank You!

@thiyangt

web: https://thiyanga.netlify.app

email: ttalagala@sjp.ac.lk

35

Data science: A game changer for science and innovation

2
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow