<?xml version="1.0" encoding="UTF-8"?>
<itemContainer xmlns="http://omeka.org/schemas/omeka-xml/v5" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://omeka.org/schemas/omeka-xml/v5 http://omeka.org/schemas/omeka-xml/v5/omeka-xml-5-0.xsd" uri="https://omeka.ibu.edu.ba/items/browse?output=omeka-xml&amp;page=16" accessDate="2026-06-11T16:02:59+01:00">
  <miscellaneousContainer>
    <pagination>
      <pageNumber>16</pageNumber>
      <perPage>10</perPage>
      <totalResults>3494</totalResults>
    </pagination>
  </miscellaneousContainer>
  <item itemId="3456" public="1" featured="0">
    <fileContainer>
      <file fileId="4268">
        <src>https://omeka.ibu.edu.ba/files/original/6b2e464890bd658d55b645c2200671f3.pdf</src>
        <authentication>368e8f19562326e2eedc5b353024065f</authentication>
        <elementSetContainer>
          <elementSet elementSetId="4">
            <name>PDF Text</name>
            <description/>
            <elementContainer>
              <element elementId="52">
                <name>Text</name>
                <description/>
                <elementTextContainer>
                  <elementText elementTextId="26402">
                    <text>Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020215

Machine Learning in Autism Spectrum Disorder Diagnosis

Naida Nalo1, Jasmin Kevrić1
1

International Burch University, Sarajevo, Bosnia and Herzegovina
naida.nalo@stu.ibu.edu.ba
jasmin.kevric@ibu.edu.ba

Abstract— This paper represents an overview of Machine Learning techniques used in Autism
Spectrum Disorder - ASD diagnosis. ASD is detected based on behavioral screening which is time
consuming and can only be taken by a medical professional. The idea is to find a smaller number of
features that are still able to equally well provide satisfying results and not lose the accuracy, sensitivity
nor specificity. Some of the algorithms mostly used in recent studies were Artificial Neural Network ANN and Alternating Decision Trees - ADTrees. The researches usually use WEKA software package
for applying the algorithm and obtaining results.
Keywords—Machine Learning, Autism Spectrum Disorder, diagnosis, features, ANN, ADTree, WEKA.

1. Introduction

Autism Spectrum Disorder (ASD) is defined as a developmental disorder that reflects in difficulty to
communicate and interact with people, to have minimal interests and to generate pattern like behaviors. There
are three separate conditions that are combined into the Autism Spectrum Disorder and those are: Autistic
disorder, Asperger’s syndrome and Pervasive developmental disorder not otherwise specified (PDD-NOS).
Even though ASD can be a lifelong disorder, an early diagnosis and proper treatment can help improve
communications skills and overall ability to function. However, the diagnosis sometimes takes a lot of time,
which prolongs the appropriate treatment [1].

Two widely used clinical diagnosis tools for diagnosing autism are The Autism Diagnostic Interview-Revised
(ADI-R) [2] and Autism Diagnostic Observation Schedule (ADOS) [3]. ADI-R consists of ninety-three
questions that are to be answered by a clinician. This process can take up to two and a half hours to conduct.
The ADOS contains four modules, that can be used to test children and adults, according to behavioral and
language levels of the person to be tested. This tool uses an algorithm that results with a diagnosis based on
the scores of responses. Each module has its own scores [4]. It has become of great importance to find a faster
but reliable method of diagnosing ASD, since the earliest treatment gives a greater chance for improvement.

�This paper gives insight on the studies conducted in the past on the subject of Machine Learning in ASD
diagnosis. A brief review on differences between two versions of Diagnostics and Statistical Manual of
Mental Disorders (DSM-IV [5] and DSM-5 [6]) is made.

2. Literature review

The ADI-R is one of two most widely used instruments for behavioral diagnosis of ASD [7]. It is structured
in a form of an exam containing ninety-three questions and can be applied to individuals from the age of
eighteen months and above. The questions are answered by a trained professional but still take up to two and
a half hours to finish. And additionally, the gap between the initial screening and the resulting diagnosis can
be around thirteen months, depending on the socioeconomic status of the family [8]. This introduces an
additional delay in the early treatment crucial for proper development of the person, especially children. In
[9], it was proposed to create an exam that can be conducted in minutes, rather than hours and receive
satisfying results.

Machine learning was used to select the right amount of questions, out of initial ninety-three, that would be
able to classify the person in either autism or non-spectrum class. In total, fifteen algorithms were tested, and
the one that performed the best with the given data was found to be the Alternating Decision Tree (ADTree).
This classification algorithm managed to successfully classify all individuals diagnosed with ASD using only
eight questions, that were previously tested with a complete set of ninety-three questions of ADI-R and
misclassified only one.

However, this research was proved to be unreliable by [10]. This paper brought to attention the importance of
understanding both the computational and clinical area before giving any conclusions. The research conducted
in [9], was limited by the imbalance of data as well as excluding a big part of it due to missing values. This
resulted in only a two-class diagnosis, when originally it should have been three. The middle class, which is
the most difficult to identify, that was removed was the ASD Spectrum, leaving only the ASD and Nonspectrum cases. A recommendation from [10] is to use the Unweighted Average Recall – UAR, which is a
measure of performance that works better for such unbalanced data and that was used in this paper, when they
tried to replicate the work done in [9]. Their results were algorithm dependent and if another algorithm was to
be applied to the same data, the number of features would vary.

Another important issue that was discussed in [4] were the differences between results of DSM-IV and the
new DSM-5. From [4] we learn that although both of the screening methods abovementioned have shown
good sensitivity, specificity and high reliability in experiments, the majority of those studies were based on
the DSM-IV rather than the new criteria for diagnosing ASD, the DSM-5. Several studies that were mentioned
in [4] had conflicting results when using the two different versions of the manual. This introduces the need to
reevaluate the current tools for diagnosis, and to adjust them to the new criteria of diagnosing ASD.

�Combining all the research and experiments from the past, and critically analyzing the results, suggestions and
advices for the future projects are provided in [11]. The author highlights that none of the screening tools
currently in use, have incorporated the machine learning algorithm for diagnosing ASD from the recent
studies. Along with the problem of unbalanced data, the overlap of ASD, ADHD and Asperger Syndrome as
well as different forms of ASD, represents another obstacle in diagnosing, mentioned in [11]. Most studies
simplified the classification process by removing these classes and leaving just the ‘Severe Autism’ and ‘Nonspectrum’ as a possible outcome. This of course leads to unreliable classifiers with questionable sensitivity,
specificity and accuracy.

3. Problem formulation

For the purpose of this project, in total three datasets have been downloaded from UCI Machine Learning
Repository [12]. They deal with data related to ASD screening of three different sets of population: toddlers,
adolescents and adults. The data was collected through an application in a form of a quiz [13]. The data sets
consist of twenty questions, out of which ten are behavioral features, while the other ten are individual
characteristics. The application offers four modules representing the age category for individuals from the age:
12-36 months, 4-11 years, 12-16 years and 17 years and older. Ten questions, that differed depending on the
age of the individual, from the application are provided in Table 1, in Appendix 1. Description of the data set
is provided in Table 2.

Table 2. Data set characteristics
ASD Screening Data
MODULE

CHILDREN

ADOLESCENTS

ADULTS

Number of instances

292

104

704

Number of attributes

21

Missing values

Yes

The first module is based on current parent-assisted ASD screening tool, the Quantitative Checklist for Autism
Toddlers (Q-CHAT), while the remaining three are based on appropriate versions of Autism Spectrum
Quotient (AQ), which are considered to be good candidates of diagnosing and were somewhat referred to as
a ‘red flag’. These screening tools were discussed in [4], where it has been noted that the ten questions can
only be used for acknowledging if additional clinical testing is required and is not a definite diagnosis. An
analysis that studied these tools is [14]. Next step of this paper will be to investigate the dataset and determine
the best way to make the most of it.

�4. Machine learning methods

Seven algorithms that were chosen for the process of attribute selection with their brief description are
provided in this chapter. Bayes Net’s function is to learn the Bayesian networks. This algorithm assumes
nominal attributes and no missing values. Search process is done using K2 or TAN algorithm. More
sophisticated search methods, used for search, are built on genetic algorithm, hill-climbing, simulated
annealing, etc. Search speed can additionally be enhanced by ADTrees [15].

Simple Logistic is an algorithm that builds regression models and fits them with use of LogitBoost and simple
regression functions as base learners. Number of iterations are calculated using cross-validation, supporting
attribute selection [15].
Decision Stump’s function is to build one-level decision trees for sets with a categorical or numeric value.
Missing values in this algorithm are dealt with by seeing them as a separate value and creating a third branch
from the stump [15].

J48 is an algorithm that creates a pruned or unpruned C4.5 decision tree. C4.5 This algorithm produces a
classifier in a form of a decision tree, which can be either a leaf or a decision node. A leaf indicates a class, a
decision node specifies a test with one branch and a subtree for every possible outcome of the test [16].

Logistic Model Tree, or LMT, combines two most popular methods of classification: linear logistic regression
and tree induction. This algorithm results in not only classification but also in explicit probability estimates of
the class. Another advantage of LMT is that it results in a single tree which makes it easier to interpret [17].

Random Forest is an algorithm that combines tree predictors. Each tree is dependent on values of a random
vector, which is sampled independently and with same distribution for all trees of the forest. Generalization
error of this algorithm depends on strength of individual trees of thee forest and their correlation. As the
number of trees grows, the generalizatioon error converges to the limit [18].

REPTree algorithm represents a fast decision tree learner. This algorithm uses information regarding gain or
variance and prunes it with reduced-error pruning to build a ecision or regression tree. Values for numeric
attributes are only sorted once, which optimized its speed [15].

5. Results

Classification procedure of this research paper was split in two main parts. First part was applying a 10-fold
cross-validation to all attributes and all three datasets. Cross-validation of n-folds splits the original dataset
into n parts where n-1 parts are used as a train test, while the nth part is used as a test set [19]. Another method
used in this part was applying a percentage split of three different values: 50%, 70% and 90%. Percentage split
separates the original dataset into train and test according to the chosen percentage.

�Both methods were tested using sixteen algorithms, giving in total 64 results for each dataset. The second
part of classification involved attribute selection. Algorithms chosen for this part resulted from the first part,
since only those that gave 100% accuracy for all three datasets were again used in attribute selection part. A
more detailed 10-fold cross-validation results and algorithm performance are presented in Table 3, and the
results of percentage-split and algorithm performance is shown in Table 4.

Table 3. 10-fold cross-validation accuracy results
Method

Cross-validation 10

Algorithm

Dataset

Bayes

Child

Adolescent

Adult

BayesNet

100%

100%

100%

NaiveBayes

98.9726%

98.0769%

97.017%

MultinomialText

51.7123%

60.5769%

73.1534%

BayesUpdateable

98.9726%

98.0769%

97.017%

Logistic

95.2055%

95.1923%

97.017%

MultilayerPerceptron

99,6575%

89.4231%

100%

SimpleLogistic

100%

100%

100%

SMO

100%

89.4231%

100%

88.3562%

90.3846%

94.8864%

DecisionStump

100%

100%

100%

HoeffdingTree

100%

99.0385%

99.858%

J48

100%

100%

100%

LMT

100%

100%

100%

RandomForest

100%

100%

100%

RandomTree

93.1507%

80.7692%

96.1648%

REPTree

100%

100%

100%

Functions

Lazy
IBk
Trees

�Table 4. Percentage split accuracy
Method

Percentage Split (50%-50%, 70%-30%, 90%-10%)

Algorithm

Dataset

Bayes

Child

Adolescent

BayesNet

100%

100%

100%

NaiveBayes

98.6%

96.5%

96.5%

MultinomialText

51.36
%

55.68
%

41.37
%

BayesUpdateable

98.6%

96.5%

96.5%

Logistic

93.1%

89.7%

93.1%

MultilayerPerceptr
on

97.2%

97.7%

100%

SimpleLogistic

100%

100%

SMO

96.5%

100%
98.07
%
61.53
%
98.07
%

100%
100%

Adult
100
%
100
%

54.83
%

50%

100%

100
%

100%
98.01
%
74.14
%
98.01
%

100%

100%

98.5%

97%

74.88
%
98.57
%

97.14
%

80%

Functions
84.61
%
94.23
%

87.09
%
93.54
%

90%

96.59
%

94.78
%

95.71
%

80%

100%

100%

100%

100%

100%

100%

100
%

100%

100%

100%

95.4%

100%

92.3%

93.54
%

80%

100%

100%

100%

89.04
%

89.7%

86.2%

88.46
%

90.32
%

100
%

95.73
%

94.31
%

94.28
%

DecisionStump

100%

100%

100%

100%

100%

100%

100%

100%

HoeffdingTree

98.6%

100%

100%

98.07
%

100%

100%

100%

100%

J48

100%

100%

100%

100%

100%

100%

100%

100%

LMT

100%

100%

100%

100%

100%

100%

100%

100%

RandomForest

100%

100%

100%

100%

100%

100%

100%

100%

RandomTree

93.8%

94.3%

82.7%

67.3%

74.19
%

100%

90.99
%

100%

REPTree

100%

100%

100%

100%

100%

100%

100%

100%

Lazy
IBk
Trees
100
%
100
%
100
%
100
%
100
%
100
%
100
%

Algorithms used for the second part of classifying process of this research paper were chosen according to the
percentage of accuracy of Table 3. Out of four Bayes algorithms, only BayesNet gave 100%, SimpleLogistic
is the only one out of four Function algorithms that proved the best, and lastly, Tree algorithms shown good
results with DecisionStump, J48, LMT, RandomForest and REPTree performing in 100% accuracy for all
three datasets. These seven algorithms were used in attribute selection part of classification. All three datasets
originally had 21 attributes, and the previous two methods mentioned above included all attributes in the
process. Attribute selection method [20] is a process of selecting the most relevant attributes and by doing so,
reducing the processing time.

�In total, five attribute evaluators have been used in attribute selection process. Those were:
ClassifierAttributeEval,

CorrelationAttributeEval,

ReliefAttributeEval,

CfsSubsetEval

and

WrapperSubsetEval. In a combination of these evaluators, three search methods were used: BestFirst,
GreedyStepwise and Ranker [15]. ClassifierAttributeEval evaluates the worth of an attribute with use of a
user-specified classifier [21]. CorrelationAttributeEval evaluates the worth of an attribute by measuring the
correlation between the attribute and the class [21]. ReliefFAttributeEval sampling of instances happens
randomly, and the neighboring instances of the same or different class is checked on [15]. CfsSubsetEval
evaluates the worth of a subset of attributes by considering the individual predictive ability of each attribute
along with the degree of redundancy between them. Missing values can be seen as a separate value or, with
proportion to their frequency, its counts can be distributed among other values [15]. WrapperSubsetEval
evaluates attribute sets by using a learning scheme. Cross-validation estimates the accuracy of the learning
scheme for a set of attributes [15] .

ClassifierAttributeEval, CorrelationAttEval and ReliefAttributeEval required Ranker as a search method. In
all three datasets, number of attributes chosen for the Ranker was five. CfsSubsetEval and WrapperSubsetEval
work using either BestFirst or GreedySetpwise search method. Combining the evaluators with search methods,
we obtained 56 results for each dataset. After the attribute selection was performed on the complete set of 21
attributes, all evaluators resulted with 100% accuracy, regardless of the algorithm used. The attributes of all
three datasets are presented in Table 5.

Table 5. Attributes by number with description
Attribute

Description

1 - 10

Score of 10 questions

11

Age (number)

12

Gender (male or female)

13

Ethnicity (list provided)

14

Born with jaundice

15

Autism in family

16

Country of residence

17

Used app before

18

Result of app (automated calculation)

19

Age description (toddler, child, adolescent, adult)

20

Relation (who is completing the test)

21

Class ASD/NoASD

Attribute that was present in all three datasets and that showed extremely high correlation was the 18th
attribute. This attribute represents the score of ten questions of the application [13]. Therefore, a new approach
was used. The 18th attribute was removed completely, and the process of selection was repeated for all three
datasets. Results of selection are shown in Table 6, 7 and 8, in Appendix 2. Results of accuracy are shown in
Table 9, 10 and 11, in Appendix 3.

�The lowest performance for child, adolescent and adult dataset was achieved by DecisionStump, resulting in
78.082%, 70.192% and 82.822% respectively, as can be observed from the results. The lowest number of
attributes selected is 1, and the highest is 14. However, the best results required less than that. The algorithms
that showed best performance for child dataset were SimpleLogistic and LMT. These algorithms, with applied
CfsSubsetEvaluator, resulted in accuracy of 98.973%, and used 10 attributes. BayesNet showed best results,
with applied ClassifierAttributeEvaluator, it performed in 90.385% accuracy for adolescent dataset and used
only 5 attributes. Simple Logistic successfully classified the adult dataset, with impressive accuracy of
99.432% and used 11 attributes in the process.

6. Conclusion

The conclusion is split into two parts, one regarding actions taken to review already written papers and discuss
their results, and second which deals with actions taken to derive our own conclusion through processing
datasets. This research paper involved three datasets: child, adolescent and adult, with each having 21
attributes. Original datasets were processed using two methods for splitting the dataset into train and test and
used sixteen algorithms for both. The obtained results from the first test helped choose algorithms for the
second part of testing which involved attribute selection. According to the results, seven algorithms stood out.
Attribute selection was performed on all three datasets using seven evaluators. All results had 100% accuracy,
despite using different number of attributes. This leaded to another approach which included removing the
18th attribute and reapplying the selection process. Number of attributes for best performances were dependent
on the dataset and therefore are different. One should keep in mind that classification process using five
attributes can only be used as an indicator of whether further medical testing should be conducted.

The main lesson learned, reading papers written so far on this topic, is that integrating ML in ASD diagnosis
and its screening tools is a much harder process than it seems. Finding the appropriate number of features and
managing to reduce the time of diagnosis depends on many parameters. Many experiments, in an attempt to
reduce the time required for the diagnosis process, have discarded some important issues for the sake of
simplicity. Their initially admiring results could not be taken for granted, due to misbalanced data and
questionable reliability. The algorithm should not be dependent on data. The issue of distinguishing between
ASD and PDD related disorders (ADHD, Asperger Syndrome) represents a big obstacle for proper diagnosis
of ASD. The algorithm should be provided with a similar number of all possible outcomes in order to learn to
better distinguish between categories. The need to reevaluate the current diagnosis tools and adjust them to fit
the new criteria from DSM-5 arises.

�APPENDIX 1
Table 1. Questions from the ASDQuiz application [13]
13-36 months TODDLER

4-11 years – CHILD

12-16 years –
ADOLESCENT

17 &amp; older –
ADULT

1.

Does your child look
at you when you call
his/her name?

He/she often notices
small sounds when
other do not?

He/she notices patterns
in things all the time?

I often notice small
sounds when others
do not?

2.

How easy is it for
you to get eye
contact with your
child?

3.

4.

5.

6.

7.

Does your child
point to indicate that
he/she wants
something (e.g. toy
out of reach)?
Does your child
point to share
interest with you?
(e.g. pointing at an
interesting sight)
Does your child
pretend? (e.g. care
for dolls, talk on a
toy phone)
Does your child
follow where you are
looking?
If you or someone
else in the family is
visibly upset, does
your child show
signs of wanting to
comfort you/them?
(e.g. gives a hug)

He/she usually
concentrates more on
the whole picture
rather than the small
details?
In a social group,
he/she can easily
keep track of several
different people’s
conversation?
He/she finds it easy
to go back and forth
between different
activities?

He/she usually
concentrates more on
the whole picture
rather than the small
details?
In a social group,
he/she can easily keep
track of several
different people’s
conversation?
If there is an
interruption, he/she can
switch back to what
he/she was doing very
quickly?

I usually concentrate
more on the whole
picture, rather than
the small details?
I find it easy to do
more than one thing
at once?
If there is an
interruption, I can
switch back to what
I was doing easily?

He/she doesn’t know
how to keep a
conversation going
with his/her peers?

He/she frequently finds
that he/she doesn’t
know how to keep a
conversation going?

I find it easy to read
between the lines
when someone is
talking to me?

He/she is good at
social chit-chat?

He/she is good at
social chit-chat?

I know how to tell if
someone listening to
me is getting bored?

When he/she reads a
story, he/she finds it
hard to work out the
character’s intentions
or feelings?

When he/she was
younger, he/she used
to enjoy playing games
involving pretending
with other children?

When I’m reading
the story, I find it
difficult to work out
the character’s
intentions?

Would you describe
your child’s first
words as: typical,
unusual, the child
doesn’t speak?

When he/she was on
preschool, he/she
used to enjoy playing
games involving
pretending with other
children?

She/he finds it difficult
to imagine what it
would be like to be
someone else?

I like to collect
information about
categories of things
(e.g. types of car,
types of bird, types
of train, types of
plants, etc.)

9.

Does your child use
simple gestures (e.g.
waves goodbye)?

He/she finds it
difficult to work out
what someone is
thinking or feeling
just by looking at
their face?

He/she finds social
situations easy?

I find it easy to work
out what someone is
thinking just by
looking at their face

10.

Does your child stare
at nothing with no
apparent purpose?

He/she finds it hard
to make new friends?

He/she finds it hard to
make new friends?

I find it difficult to
work out people’s
intentions?

8.

�APPENDIX 2
Table 6. Attribute selection results - child dataset
Algorithm
s vs.
selected
attributes
BayesNet
SimpleLo
gistic
DecisionS
tump
J48

ClassifierAttrib
uteEval

CorrelationAttrib
uteEval

4,9,8,10,1

4,9,10,8,6

ReliefAttribu
teEval

4,1,10,8,9

CfsSubset
Eval (BF
&amp;
Greedy)

1-10

LMT
RandomF
orest
REPTree

WrapperSubs
etEval (BF

Wrappe
r,
Greedy

1-10,17
3,4,6,7,10,12,
16

4,10
4,6,10,1
2,16

4

4

1,3,4,5,7,8,10
4,10
,14
4,6,10,12,16
15,7,8,9,10,15,
4,10
17
1,4,8,10
4,10

Table 7. Attribute selection results - adolescent dataset
Algorithm
s vs.
selected
attributes

ClassifierAttrib
uteEval

CorrelationAttrib
uteEval

ReliefAttribu
teEval

CfsSubset
Eval (BF
&amp; Greedy)

BayesNet
SimpleLo
gistic
DecisionS
tump
J48
LMT
RandomF
orest
REPTree

5,4,3,10,6

WrapperSubs
etEval (BF

Greedy

1,2,3,4,5,7,10
,12

1,3,4,5,1
0,12

3,4,5,10,14

5,4,10,3,6

5,4,10,3,6

5,3,10,4,8

3,4,5,6,7,8,
10,17

5
2,5,9,14
3,4,5,10,14

5,4,3,10,6

15,7,8,10,17

5,4,10,3,6

2,3,5,8

4,5,7,10

Table 8 Attribute selection results - adult dataset
Algorithm
s vs.
selected
attributes
BayesNet
SimpleLog
istic
DecisionSt
ump
J48
LMT
RandomFo
rest
REPTree

ClassifierAttribu
teEval

CorrelationAttrib
uteEval

ReliefAttribut
eEval

CfsSubset
Eval (BF
&amp;
Greedy)

9,6,16,19,7

WrapperSubs
etEval (BF

Greed
y

1-10,15,16
1,3,5,6,9,12,1
1,3,5,
5
6,9
9

9,6,8,7,19

9,6,5,4,3

5,9,6,4,7

1-10,16

1-5,710,16,15
1,3,5,6,9,12,1
5

9
1,5,9
1,3,5,
6,9

1-10,12,15,18,19
1,2,3,5,6,7,9,15

�APPENDIX 3
Table 9. Attribute selection accuracy - child dataset
Attribute
evaluator
ClassifierAttribu
teEval
CorrelationAttE
val
ReliefAttributeE
val
CfsSubsetEval
WrapperSubsetE
val (BF)
WrapperSubsetE
val (Greedy)

Bayes
Net
86.644
%
84.932
%
85.616
%
95.206
%
91.438
%
82.192
%

SimpleLog
istic

DecisionSt
ump

J48

LMT

78.082%

85.95
9%
84.93
2%
83.90
4%
91.43
8%
85.27
4%
82.19
2%

85.95
9%
84.58
9%
87.32
9%
98.97
3%
95.89
0%
83.90
4%

86.301%
84.589%
88.014%
98.973%
95.890%
82.192%

RandomF
orest
85.274%
81.849%
85.274%
93.151%
89.384%
84.247%

REPT
ree
85.616
%
85.959
%
83.219
%
83.562
%
84.589
%
83.562
%

Table 10. Attribute selection accuracy - adolescent dataset
Attribute
evaluator
ClassifierAttribu
teEval
CorrelationAttE
val
ReliefAttributeE
val

Bayes
Net

CfsSubsetEval

87.5%

WrapperSubsetE
val (BF)
WrapperSubsetE
val (Greedy)

88.462
%
77.885
%

90.385
%
89.423
%
82.692
%

SimpleLog
istic

DecisionSt
ump

89.423%
70.192%

J48

80.76
9%

81.731%
89.423%
86.539%
68.269%
78.846%

84.61
5%
72.11
5%
71.15
4%

LMT
89.42
3%
81.73
1%
88.46
2%
85.57
7%
78.84
6%

RandomF
orest

REPT
ree

86.539%
75%
85.577%
83.654%
86.539%

78.846%

76.923
%
68.269
%
72.115
%
71.154
%

Table 11. Attribute selection accuracy - adult dataset
Attribute
evaluator
ClassifierAttribu
teEval
CorrelationAttE
val
ReliefAttributeE
val
CfsSubsetEval
WrapperSubsetE
val (BF)
WrapperSubsetE
val (Greedy)

Bayes
Net
90.341
%
91.051
%
90.767
%
96.307
%
95.313
%
92.046
%

SimpleLog
istic

DecisionSt
ump

89.347%
90.909%
84.517%
90.625%
99.432%
95.881%
82.822%
93.04%

J48

LMT

88.21
%

89.35
7%
90.90
9%
90.62
5%
99.00
6%
95.02
8%
93.89
2%

89.06
3%
92.04
6%
89.34
7%
89.06
3%

RandomF
orest

REPT
ree

89.347%

87.5%

91.761%
90.057%
91.193%
94.46%
91.761%

90.199
%
89.205
%
86.364
%
89.063
%
89.921
%

�7. References

[1]

National Institute of Mental Health, Available at: https://www.nimh.nih.gov/health/topics/autism-

spectrum-disorders-asd/index.shtml
[2]

Lord, C., Rutter, M., &amp; Le Couteur, A. 1994. Autism Diagnostic Interview-Revised: A revised

version of a diagnostic interview for caregivers of individuals with possible pervasive developmental
disorders. Journal of Autism and Developmental Disorders, 24, 659–685
[3]

Lord, C, Risi, S, Lambrecht, L et al, 2000. The Autism Diagnostic Observation Schedule-Generic:

a standard measure of social and communication deficits associated with the spectrum of autism J Autism
Dev Disord. 2000; 30:205–223.7.
[4]

F. Thabtah, 2017. Autism Spectrum Disorder Screening: Machine Learning Approach and DSM-5

Fulfillment, ICMHI '17, May 20-22, 2017, Taichung City, Taiwan © 2017 Association for Computing
Machinery. ACM ISBN 978-1-4503-5224-6/17/05
[5]

American Psychiatric Association 2000. Diagnostic and statistical manual for mental disorder (4th

edn), Text revision. Washington, DC: American Psychiatric Association
[6]

American Psychiatric Association. 2013. Diagnostic and statistical manual of mental disorders:

DSM-5. Washington, D.C: American Psychiatric Association
[7]

C. Lord et al., 1994. Autism Diagnostic Interview-Revised: A Revised Version of a Diagnostic

Interview for Caregivers of Individuals with Possible Pervasive Developmental Disorders, Journal of Autism
and Developmental Disorders, Vol. 24, No. 5, 1994
[8]

L. D. Wiggins, J. Baio, C. Rice, 2006, Examination of the Time Between First Evaluation and First

Autism Spectrum Diagnosis in a Population-based Sample, Developmental and Behavioral Pediatrics, Vol.
27, No.2, April 2006.
[9]

Wall DP, Dally R, Luyster R, Jung J-Y, DeLuca TF (2012) Use of Artificial Intelligence to Shorten

the Behavioral Diagnosis of Autism. PLoS ONE 7(8): e43855. doi: 10.1371/journal.pone.0043855
[10]

D. Bone, M. S. Goodwin, M. P. Black, et al, 2014. Applying Machine Learning to Facilitate Autism

Diagnostics: Pifalls and Promises, Springer Science + Business Media New York 2014
[11]

F. Thabtah (2018): Machine learning in autistic spectrum disorder behavioral research: A review

and ways forward, Informatics for Health and Social Care, DOI: 10.1080/17538157.2017.1399132
[12]

UCI Machine Learning Repository, Available at: https://archive.ics.uci.edu/ml/index.php

[13]

Thabtah, F. (2017). ASDTests. A mobile app for ASD screening. www.asdtests.com

[14]

C.Allison, B. Auyeung, S. Baron-Cohen, 2012, Toward Brief “Red Flags” for Autism Screening:

The Short Autism Spectrum Quotient and the Short Quantitative Checklist in 1000 Cases and 3000 Controls,
Journal of the American Academy of Child &amp; Adolescent Psychiatry, DOI: 10.1016/j.jaac.2011.11.003
[15]

I.H. Witten, E. Frank (2005), Data Mining: Practical Machine Learning Tools and Techniques,

Elsevier
[16]

J.

Ross

Quinlan,

C4.5:

Programs

for

Machine

Learning,

Avilable

at:

https://link.springer.com/article/10.1007/BF00993309
[17]

University

of

Waikato,

Logistic

at:https://www.cs.waikato.ac.nz/ml/publications/2005/LMT.pdf

Model

Trees,

Available

�[18]

L.

Breiman

(2001),

Random

Forests,

Available

at:

https://www.stat.berkeley.edu/~breiman/randomforest2001.pdf
[19]

Towards Data Science, Cross-Validation in Machine Learning, K-Fold Cross Validation, Available

at: https://towardsdatascience.com/cross-validation-in-machine-learning-72924a69872f
[20]

Machine

Learning

Mastery,

An

Introduction

to

Feature

Selection,

Available

https://machinelearningmastery.com/an-introduction-to-feature-selection/
[21]

Weka – Version 3.8.3, Waikato Environment for Knowledge Analysis, ClassifierAttributeEval

at:

�</text>
                  </elementText>
                </elementTextContainer>
              </element>
            </elementContainer>
          </elementSet>
        </elementSetContainer>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26352">
                <text>Machine Learning in Autism Spectrum Disorder Diagnosis</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26353">
                <text>Naida Nalo, Jasmin Kevrić</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26354">
                <text>This paper represents an overview of Machine Learning techniques used in Autism Spectrum&#13;
Disorder - ASD diagnosis. ASD is detected based on behavioral screening which is time consuming and&#13;
can only be taken by a medical professional. The idea is to find a smaller number of features that are still&#13;
able to equally well provide satisfying results and not lose the accuracy, sensitivity nor specificity. Some&#13;
of the algorithms mostly used in recent studies were Artificial Neural Network - ANN and Alternating&#13;
Decision Trees - ADTrees. The researches usually use WEKA software package for applying the algorithm&#13;
and obtaining results.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26355">
                <text>Machine Learning, Autism Spectrum Disorder, diagnosis, features, ANN, ADTree,&#13;
WEKA.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="43">
            <name>Identifier</name>
            <description>An unambiguous reference to the resource within a given context</description>
            <elementTextContainer>
              <elementText elementTextId="26356">
                <text>2637-2835</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26357">
                <text>International Burch University, Sarajevo, Bosnia and Herzegovina</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26358">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26395">
                <text>January, 2020</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3455" public="1" featured="0">
    <fileContainer>
      <file fileId="4267">
        <src>https://omeka.ibu.edu.ba/files/original/c4bae5fbec77718447ec73dc65649ad6.pdf</src>
        <authentication>62737558fce1a681d66b8438c8f8fedb</authentication>
        <elementSetContainer>
          <elementSet elementSetId="4">
            <name>PDF Text</name>
            <description/>
            <elementContainer>
              <element elementId="52">
                <name>Text</name>
                <description/>
                <elementTextContainer>
                  <elementText elementTextId="26401">
                    <text>Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214

Handwriting digit recognition using Decision Tree Classifiers

Demir Korać1, Samed Jukić1, Mujo Hadžimehanović1
1

International Burch University, Sarajevo, Bosnia and Herzegovina
demir.korac@stu.ibu.edu.ba
samed.jukic@ibu.edu.ba
mujo.hadzimehanovic@stu.ibu.edu.ba

Abstract – The usage of handwritten character recognition has been useful for usage from large to
common consumer usage. The transitional period of the handwritten to the digital age can be largely
improved by focusing on perfecting handwritten character recognition. This paper and work aims to
focus on handwritten digit recognition using the decision tree classifier machine learning method,
implemented, trained and tested on the data set gathered from the Modified National Institute of
Standards and Technology dataset. The data to be recognized is inputted from a pre-existing reliable
set, used both for training and testing, in order to give a fair result. The system is run through a
Python script and the data set is stored in CSV format, preprocessed and ready for further usage.
Taking into consideration the size of the dataset (42000 rows of data), the system’s overall performance
is satisfactory with an accuracy of 85% and outputs the results in an understandable manner.
Keywords – character, decision, handwritten, recognition

1.

Introduction

Optical character recognition is the process of mechanically or electronically converting typed, printed or
handwritten images of text into a machine-encoded text. The source can be taken from a photo of a
document, a scanned document or a scene photo (billboards in a landscape photo). Another possible source is
superimposed text on an image, such as subtitle text from a television broadcast. This process of recognition
and the technique used in it is called a handwriting recognition system. In literature this recognition is
classified into offline handwriting recognition and online handwriting recognition depending on the style of
recognition. If an image of handwriting is previously acquired and recognized after it will be classified as
offline recognition. However if the handwriting is inputted directly into the machine to be recognized this is
called online recognition. One way of doing this is writing on a touchpad or other device dedicated for
writing and having it recognized. Another classification exists regarding the recognition and it is based on the
process of recognition itself and the technique.

We can recognize two main categories: segmentation free and segmentation based recognition. Segmentation
free recognition is based on recognizing the character without segmentation into smaller units or characters,

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
i.e. words into characters. Segmentation based recognition is the opposite of this. In this approach each word
is segmented into smaller units/characters and each character is recognized separately.

Handwriting has been for the most part of our history, the primary means of communication and information
organization, but with the modernization of these fields handwriting is becoming slowly obsolete. However
the legacy of handwritten information cannot be overlooked, so a transitional period and process needs to
occur.
That’s where handwriting and optical character recognition comes into play. The importance of handwriting
recognition is reflected in many industries such as:
health care (7,000 people are killed per year by the poor handwriting of doctors)
automotive (digital handwriting solutions allow drivers to write characters or numerals, or simply
gesture with their fingertips on vehicles' onboard computer screens instead of typing on a standard
keyboard)
field services (field service technicians use HWR technology for digital data capture, decreasing
paperwork and information loss while also allowing technicians to see precise notes on customer
history)
education (using HWR technology, students can benefit from more than just the increased
comprehension linked to taking handwritten notes. For example, handwriting recognition tech can
take your sloppy algebra equation, convert it into neat, digital text, and then crunch the numbers in a
matter of seconds. )
consumer (with the rapid success of tablets and smartphones, the market is desperately in need of an
alternative to inaccurate, digital keyboards and HWR tech is the best remedy.)

2. Previous work

Perwej Y. and Chaturvedi A. have worked on recognizing the English alphabet handwriting using Artificial
Neural Networks using binary pixels of the alphabet to train the Neural Network and the accuracy of this
method is 82.5% [1]. The data set that they worked on are handwritten English alphabet characters which are
scanned from documents and then “cleaned” and “smoothed”. The characters are then split into 25 segment
grids, scaling and thinning the segments of the characters to obtain skeletal patterns. This is then transformed
into binary values representing the segments input. (Shown in Figure 1.)

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214

Input Alphabet

Figure 1. Process of recognition
Amrouch M. and Y. Es saady have worked on a method using sliding window based on the Hough transform
as a feature extraction technique [2]. An inputted image is divided into two windows and then a dominant
direction is determined based on the Hough transform and a directional feature vector sequence has been
formed. This method is based on continuous HMMs and directional features with an average accuracy of
90.4%.The data set that they worked on is a database of Amazigh printed characters, containing 240 isolated
characters. The images of characters are in Gray level of dimensions 96x96 pixels.

M. Hanmandlu and O.V. Ramana Murthy [3] have done a study on recognition of Hindi and English
numerals. The numerals are represented in the form of exponential membership functions which serve as a
fuzzy model. They have achieved an overall recognition rate of 95% for Hindi numerals and 98.4% for
English numerals. The data set that they worked on is a database of 5000 samples of numerals for
handwritten English numerals. For Hindi they used a database of totally unconstrained handwritten numerals
created using the services of a large number of writers, since there is no standard database available at the
moment for handwritten Hindi numerals.

Nafiz Arica at al. has proposed a method of recognition without pre-processing which he believed leads to
loss of necessary information [4]. This has been backed up with a powerful segmentation algorithm with
utilization of character boundaries, maxima and minima, slant angle, upper and lower baselines, stroke height
and width and ascenders and descenders which improved the search algorithm of the optimal segmentation
path, applied on a gray-scale image. The dataset used was the handwritten database of LancesterOslo/Bergen, which contains single author cursive handwriting. In this dataset, 1,000 words with lower-case
letters are segmented and used for HMM training and another disjoint set of 2,000 words are used for testing
performance of the proposed system.
Table 1. Results of recognition for the LOB Dataset
TEST DATA
SIZE
LOB Dataset

2000

LEXICON SIZE
50

1000

30000

40000

92.3

90.8

89.1

88.8

The overall recognition rate of the whole system on word basis for various lexicon sizes is shown in the table
above.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
3. Method and Materials

The dataset being used in the project is the famous Modified National Institute of Standards and Technology
database of handwritten digits [5], as it is the most reliable and largest database of this type. This dataset is
taken as a subset of another larger dataset by NIST. The MNIST database of handwritten digits consists of
42000 rows of data. It will be split into a 80/20 ratio for training and test data respectively, so the training
data will have 33600 rows and test data will have 8400 rows of data. The digits have been size-normalized
and centered in a fixed-size image. The database and the files such as the training set images and labels, and
the test set images and labels can be found at: http://yann.lecun.com/exdb/mnist/

The machine learning model used will be a Decision Tree Classifier implemented through the scikit-learn
machine learning library in Python [6]. Decision Trees are used in data mining, machine learning and
statistics as a predictive modeling approach. The structure of trees is as follows: Class labels are represented
as leaves and the conjunctions of features that lead the class labels are represented as branches. The process
of operation will be such that the dataset will be inputted as a matrix containing cell inputs from the database
as intensity values of pixels of 28x28 images.
Table 2. Sample from dataset
label

pixel0

pixel1

pixel2

pixel3

pixel4

pixel5

pixel6

pixel7

pixel8

pixel9

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

4

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

7

0

0

0

0

0

0

0

0

0

0

3

0

0

0

0

0

0

0

0

0

0

5

0

0

0

0

0

0

0

0

0

0

3

0

0

0

0

0

0

0

0

0

0

8

0

0

0

0

0

0

0

0

0

0

9

0

0

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

3

0

0

0

0

0

0

0

0

0

0

3

0

0

0

0

0

0

0

0

0

0

1

0

0

0

0

0

0

0

0

0

0

2

0

0

0

0

0

0

0

0

0

0

An empty classifier will be created and using the fit method it will be filled with the training data. After the
classifier finishes with the training data, we will move on to the rest of the data set, the testing part. Using the
predict method we will output the classifier prediction of the handwritten digit from the test dataset along
with the actual image of the digit.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
4. Process
The process of recognition is as follows:
The dataset is imported and separated into two parts with a ratio of 80/20. The train set consists of 33600 and
the test set consists of 8400 rows of data.

Splitting the dataset:
traindata=data[1:33600,1:]
train_label=data[1:33600,0]

The data and its label are fitted to the model to learn from it. We do this by using the fit method and passing
our training set to it.

Fitting the training set:
clf.fit(traindata,train_label)

The user of the system gives an input of the order number he wants to test the system with, from the dataset.
That number is taken and a sample with that order number is chosen from the dataset, along with its actual
label.

Figure 2. Taking input from the user

Using the predict method the sample is predicted from the sample and the prediction is outputted. Using the
shape method we are taking the sample from a row vector and shaping it as a 28x28 matrix of pixel intensity
values. We are then creating a figure, which will be displayed after prediction.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
Using the shape method:
d.shape=(28,28)
When the figure is created, we are proceeding with the prediction. We will then be using the predict method
from the Decision Tree Classifier in the scikit-learn library [7].

Using the predict method along with output of the data:
print ("The predicted digit is =", clf.predict( [testdata[number]]))
print ("The actual digit is =", actnum)

Along with predicting the digit, we will also be outputting the actual label of the sample number that the user
chose. An extra feature added is the current certainty of prediction calculation.

Certainty of prediction:
p=clf.predict(testdata)
count=0
for i in range (0,8400):
count+=1 if p[i]==actual_label[i] else 0
print ("Certainty of prediction=", (count/8400)*100,"%")

For this feature we are taking all numbers in the range of the test data set and using the predict method, as to
calculate the success rate of our model on the current test data.

Figure 3. The systems’ process from the users’ perspective

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
5. Usage workflow
In Figure 4. We can see the workflow according to which the system will be operating.

Figure 4. Usage workflow

Since the system does not need dataset input from the user, we will just be importing the dataset. The system
needs user input, since the user will be choosing the digit that is to be recognized, so he needs to provide the
order number from the data. The system then outputs its prediction along with other data such as the actual
label and the prediction certainty. This allows the user to analyze the data given to him, and check if the
prediction was correct and is the prediction certainty satisfactory.

4. Decision Tree Classifier in scikit-learn

A decision tree is a flowchart-like tree structure where an internal node represents a feature (or attribute), the
branch represents a decision rule, and each leaf node represents the outcome. The topmost node in a decision
tree is known as the root node. It learns to partition on the basis of the attribute value. It partitions the tree in
a recursive manner called recursive partitioning. This flowchart-like structure helps you in decision making.
It's visualized like a flowchart diagram which easily mimics the human level thinking. That is why decision
trees are easy to understand and interpret.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214

Figure 5. Decision tree operation diagram
5. Results

After processing and thorough testing the model seems to have a success rate of ~85% on the test dataset
given to it. The system itself may have some confusion when analyzing similar digits. This occurs because
the digits themselves are handwritten and because the data has been gathered from over 250 different writers,
and some of them write different digits in a similar manner. An example is shown in Figure 6. where we can
see that the system is having trouble predicting number 9, but instead predicts 3. From the image we can see
that the digits themselves are quite similar.

Figure 6. Wrong prediction of digit 9 as digit 3

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020214
Different parts of the world write digits differently, so the algorithm would be much more accurate if it
would be implemented based on location, and also, if different Machine Learning methods or Ensemble
methods would be used, we would surely improve accuracy. Finally the thing that would take the recognition
to the next level is image quality, considering that this algorithm has been implemented on a dataset with
quite low quality of images.

5. Conclusion

It should be noted that this data set is not sufficient for a higher level model, to be used in production, but as
its using a small dataset, used for initial training, it has shown to be a pretty successful model and method.
If we would train the aforementioned algorithm on a larger dataset with higher image quality, it could be
used for real world projects. The confusion of the system may also be affected by the quality of the images in
the given dataset, as it is limited to only 28x28 matrices. Future training with a dataset with matrices of larger
magnitude may prove more successful.

REFERENCES
[1]

Perwej Y. &amp; Chaturvedi A. -Neural Networks for Handwritten English Alphabet Recognition.

2011 IJCA
[2]

M. Amrouch, A. Rachidi, M. El Yassa, D. Mammass - Printed amazigh character recognition by a

hybrid approach based on Hidden Markov Models and the Hough transform. 2009 IEEE https://scikitlearn.org/stable/modules/generated/sklearn.tree.DecisionTreeClassifier.html
[3]

M. Hanmandlu, O.V. Ramana Murthy - Fuzzy model based recognition of handwritten numerals.

2007 Pattern Recognition
[4]

N. Arica, F.T. Yarman-Vural - Optical character recognition for cursive handwriting. 2002 IEEE

[5]

MNIST handwritten digit database - http://yann.lecun.com/exdb/mnist/

[6]

Decision Tree Classification in Python -

https://www.datacamp.com/community/tutorials/decision-tree-classification-python
[7]

sklearn.tree.DecisionTreeClassifier-

�</text>
                  </elementText>
                </elementTextContainer>
              </element>
            </elementContainer>
          </elementSet>
        </elementSetContainer>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26344">
                <text>Handwriting digit recognition using Decision Tree Classifiers</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26345">
                <text>Demir Korać, Samed Jukić, Mujo Hadžimehanović</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26346">
                <text>The usage of handwritten character recognition has been useful for usage from large to&#13;
common consumer usage. The transitional period of the handwritten to the digital age can be&#13;
largely improved by focusing on perfecting handwritten character recognition.&#13;
This paper and work aims to focus on handwritten digit recognition using the decision tree&#13;
classifier machine learning method, implemented, trained and tested on the data set gathered from&#13;
the Modified National Institute of Standards and Technology dataset.&#13;
The data to be recognized is inputted from a pre-existing reliable set, used both for training and&#13;
testing, in order to give a fair result. The system is run through a Python script and the data set is&#13;
stored in CSV format, preprocessed and ready for further usage.&#13;
Taking into consideration the size of the dataset (42000 rows of data), the system’s overall&#13;
performance is satisfactory with an accuracy of 85% and outputs the results in an understandable&#13;
manner.&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26347">
                <text>character, decision, handwritten, recognition</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="43">
            <name>Identifier</name>
            <description>An unambiguous reference to the resource within a given context</description>
            <elementTextContainer>
              <elementText elementTextId="26348">
                <text>ISSN 2637-2835</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26349">
                <text>International Burch University, Sarajevo, Bosnia and Herzegovina</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26350">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26394">
                <text>January, 2020</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3454" public="1" featured="0">
    <fileContainer>
      <file fileId="4266">
        <src>https://omeka.ibu.edu.ba/files/original/b07ea9e57cba6ae337bfd4515469085d.pdf</src>
        <authentication>1aead1a15145473c159b69bbe91031a6</authentication>
        <elementSetContainer>
          <elementSet elementSetId="4">
            <name>PDF Text</name>
            <description/>
            <elementContainer>
              <element elementId="52">
                <name>Text</name>
                <description/>
                <elementTextContainer>
                  <elementText elementTextId="26400">
                    <text>Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213

Biometrics Based Access Control System

Mujo Hadžimehanović1, Dino Kečo1, Demir Korać1
1
International Burch University, Sarajevo, Bosnia and Herzegovina
mujo.hadzimehanovic@stu.ibu.edu.ba
dino.keco@ibu.edu.ba
demir.korac@stu.ibu.edu.ba
Abstract – Access control includes attendance checking and intrusion prevention. It is used to protect
property, employees and other assets of a company or institution. Since attendance checking and
intrusion detection are important segments of many educational institutions and other businesses as well,
it is important to make these processes faster, easier and as convenient as possible. Lots of institutions
are suffering from unreliable attendance checking methods, so we have decided to use biometrics, more
precisely face recognition to automate and improve this overall process. As part of this study the full
system has been implemented for recognition of people. As an example of usage in an educational
institutions multiple photos will be recorded during the class session, so that in case of students leave
class after the first shot, they will be removed from the attendance sheet. All recognized people will be
stored in Mongo database as an array of features and later read from database and processed by using
Python script for face recognition. All educational institutions are going to have benefits from this study.
Benefits would be improving attendance management and security.
Keywords - attendance, face, images, recognition
1.

Introduction

Technology is rapidly improving nowadays and everyday activities are adopting these improvements. Point is
to automate these activities and not to lose time performing them. Attendance is a really important part in most
organizations such as schools, faculties, companies etc. Today, attendance is performed in various ways. Best
way to do it is biometrics. Biometrics is a bioengineering area which is an automated method for person
recognition based on its physiological or behavioural characteristics. There are many biometric templates such
as fingerprints, face, hand geometry, iris, voice or signature. System is going to use face biometric template
because it is the fastest approach and requires no human intervention. This method is better than other biometric
methods because these methods are time consuming. There are also lots of systems which are using RFID
cards, location based attendance tracking systems, signature based etc. Negative things about these methods
are that they can be faked. In RFID and location based systems employees are carrying RFID cards or GPS
locators. So, other people can check instead of other employees. There are two main stages of face recognition
process and they are face detection and face identification [1]. Recording employees' work hours and their
activities, attendance of students in schools are really important components of every company or school. This
process is maintained by using signature, fingerprint, iris, RFID or face recognition. System is going to use a
camera which captures images of people entering the company or school building. Detected faces will be
compared with pictures which are already in the database. If a person's picture is in the database attendance

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
will be checked automatically. Otherwise, the security system is going to be informed that an unrecognized
person entered the building. So, this system can also be used as an intrusion detection system.

2.

Literature Review

Following section contains a presentation of all related work and their methods.
Since they have large-scale data with massive noisy labels, X. Wu, R. He, Z. Sun, and T. Tan used a Light
CNN framework [2]. Their Light CNN architecture contains Max-Feature-Map to suppress low activation
neurons in all layers. Their model was trained on Celeb-1M dataset. In order to handle noisy labeled images,
they proposed a semantic bootstrapping method to automatically re-label training data via pre-trained deep
networks. For training purposes they used five types of databases. First type is commonly used Labeled Faces
in the Wild which consists of ~13,250 images of ~5,750 people. At VR@FAR=0 for Light CNN-29, they
achieved 97.50%, while results from all other methods were lower than 70%. Next type of the database are
collections of images extracted from Youtube videos which contain YouTube Celebrities (YTC) and
Celebrity1000 database. Precision achieved for these datasets is 94.18%. Third type are MegaFace, IJB-A and
IJB-B datasets which are challenging and they got 85.13% precision. Cross-domain databases are the fourth
type of database. It includes CACD - VS, Multi - PIE and the CASIA NIR - VIS 2.0 database. They achieved
98.55% on CACD, 95.0% on Multi-PIE and on CASIA VR@FAR=0.1% result is further improved from
94.03% to 94.77%.
M. Arsenovic, S. Sladojevic, A. Anderla, and D. Stefanovic use Convolutional Neural Networks (CNNs)
cascade to detect faces and CNN to generate embeddings of each face [3]. Fact is the best results for larger
datasets are achieved by using CNNs, but in their production environment that was not the case. CNN gave the
best results for smaller datasets. Accuracy of 95.02% was achieved on a dataset created by authors in the realtime environment. Five employees of the company took pictures of themselves and they used these pictures as
a dataset. Model was trained with these 5 pictures.
Active annotation and learning framework was used by H. Ye et al [4]. They are starting with face image
training set without labels and train a deep neural network iteratively model created was used to choose
examples for further manual annotation. After following active learning strategy, Value of Information
criterion is derived to actively select candidate annotation images. This model reaches the coverage of 70.7%
with a precision of 95%.
MSR Image Recognition Challenge by J. Li et al introduces a knowledge base which has an idea to assign each
face unique entity key and provide large dataset consisting of about 100,000 famous persons with around 100
images per person (MsCeleb) [5]. Method achieved coverage of 46.1% at 95% precision on the random dataset
and 33% at 95% precision on the hard set of their challenge. Authors proposed a method consisting of two
stages to learn robust human face representations for effective recognition of human faces. First stage in the
training set is cleaning the noisy data because dataset is taken from the internet so images without faces can
appear. In order to do so, a deep neural network was trained on existing dataset.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
S. Chintalapati and M. V. Raghunadh used SVM and Bayesian classifier for automated attendance system
based on face detection and recognition [6]. They proved that these classifiers are better when compared to
other distance classifiers. This system automatically detects the student which enters the classroom and marks
the attendance if recognizes him. One of the failures of the system is recognizing faces only up to 30 degrees
angle variations.

3.

Methods and Materials

Dataset to be used is Labeled Faces in the Wild [7] which is a database of face photographs designed for
studying the problem of unconstrained face recognition. It contains more than 13,000 pictures collected from
the web. Each image has been labeled with the name of the person on it. There are 1680 different persons in
images. Fig.1 shows samples from LFW dataset.

Fig.1. Samples from dataset
These images need to be processed in order to get numerical representation of faces which is called feature
vector. Feature vector consists of various numbers in a specific order which can be: height or width of face,
width of lips, nose height etc… Final output of processed image needs to be an array with features which is
shown in Fig.2. All features are stored in the mongo database for speed improvement. Python script iterates in
a folder which has dataset images and stores one by one in a database with image name and features. Face
Recognition library with deep learning is going to be used for this project. Deep learning model has an accuracy
of 99.38% on Labeled Faces in Wild dataset. Features of face recognition library are finding faces in pictures,
finding facial features in pictures and identifying faces in pictures. Once installed face recognition gives us two
command line programs:
●

face_recognition - recognize face on image

●

face_detection - detect face in image

Face recognition process consists of two stages. This includes taking and preparing training dataset and
integration into existing system. For testing purposes, data was collected at the university. These are images of
students which were taken in the first year. Images are preprocessed and inserted into the mongo database by

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
using python script for inserting images. After insertion, facebook images of the same people were taken and
tested by using python script for face recognition.

Fig.2. Array of features
3.1

Data preprocessing

Implemented system is going to use monitoring cameras at the entrance. It means that we could have some
kind of network or other problems and taken images could be blurry, so we have to include such images in
training dataset, Fig.3. Persons entering the building can be photographed from different angles, so these kind
of photos should be included also in training dataset, Fig 4.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213

Fig.3. Original and Blurred image
Fig.3 shows an example of original and blurred images. If we have perfect conditions we would have a picture
like the original one, but if the system experiences network issues we might have a blurry image like the one
represented. System has to be ready to respond accordingly to these kinds of issues. Python script was written
using OpenCV [8] interface to generate blurry images out of the original ones.

Fig.4. Image from front and side
In Fig.5 we can see facial features drawn on picture of Pep Guardiola. Most important features are shown: eyes,
nose, mouth and chin location. These features are used when recognizing people on images. Face recognition
library contains script face_landmarks for detecting facial landmarks and positioning the face based on them.

Fig.5. Facial features

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
3.2

Usage Workflow

Application usage workflow is represented on Fig.6. Images need to be collected into a single folder, so that
insertion helper scripts can be run. For multiple insertions we need to pass a folder of images to script, which
iterates through these images, creates encodings and inserts them into database. Single insertion script accepts
an image as a parameter, encodes it and inserts into a database. Last step is recognizing images, the script
accepts an image which needs to be recognized and iterates through mongoDB collection and looks for
matching images.

Fig.6. Usage workflow
3.3

Face Recognition Library

Face recognition library which we are using is built using dlib’s face recognition with deep learning. We can
install library by using a python package installer. Once installed we are provided with two command-line
programs : face_recognition and face_detection. Face recognition recognizes faces in a photograph or in a
folder of photos. There should be two folders, one containing known people and second which contains photos
of people which we want to recognize. Face recognition program is run with two parameters which are the
names of these folders. Face detection program finds pixel coordinates of faces. It takes a folder with images
as parameter and at the end prints one line for each face that was detected. There is also an option to speed up
the overall process by doing a recognition with multiple CPU cores. For example if we have 8 core CPU, we
can process 8 times as many images in the same amount of time.
Dlib is a toolkit written in C++ and contains ML algorithms and tools for solving real world problems. The
most important thing is that dlib is an open source library which enables anyone to use it anywhere, free of any
charge. Some of the dlib’s features are Deep Learning, Multiclass SVM, Image Processing etc.. Our library
uses Image Processing tool for face recognition built by using deep learning tools from dlib.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
3.4

Helper Scripts

There are multiple helper scripts which enable user to insert multiple or single image into database and
recognize faces. The most important parts of scripts are represented in the following lines.
insertMultiple.py
for image in images:
current_image = face_recognition.load_image_file("images/" + image)
encodings = face_recognition.face_encodings(current_image)
if len(encodings) &gt; 0:
current_image_encoded = encodings[0]
num_of_images+=1
else:
print("No faces found in the image " + image)
num_of_not_found+=1
continue
mydict = { "image": image, "encoding": current_image_encoded.tolist() }
x = mycol.insert_one(mydict)
print(image + " inserted")

Code snippet above loads images from the folder, encodes them and inserts them into mongoDB. We have to
provide the name of the folder which contains images and simply run the script.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
faceRecognition.py
unknown_image = face_recognition.load_image_file("image.jpg")
unknown_face_encoding = face_recognition.face_encodings(unknown_image)[0]
known_faces = []
names = []
for x in mycol.find():
known_faces.append(x['encoding'])
names.append(x['image'])
results = face_recognition.compare_faces(known_faces, unknown_face_encoding)
for x in range(len(known_faces)):
if results[x] == True:
print("Recognized: " + names[x])
else:
print("Failed: " + names[x])

Code represented above does face recognition. It takes an image of the person which we want to recognize,
encodes it, loops through face encodings from the database and checks if a person exists in the database.

4.

Results

By using a custom dataset which was collected from our university. Students' images were taken and tested on
created scripts. From these tests we have obtained accuracy of 90.9% when testing on images found on
Facebook. There were some problems when recognizing people from different angles, but this can be material
for further study. Images of people are not shown because they did not agree to publish their images.
Since face_recognition python library has a pre-trained model there is no need for additional training.
Improvement is that all images are inserted in mongoDB with image name and face encodings array. Fig.5
shows one part of mongoDB record. Python script for inserting images in mongoDB is written and it takes a
folder with images and inserts one by one in the database. In our testing environment LFW dataset is used and
all images are collected into a single folder. Number of images inserted in the database is 4014. There are also
images on which faces are not recognized. Unrecognized images number is 21. So, if we take into consideration
that 4014 images are inserted and 21 are unrecognized which means that more than 99% images were
recognized. Example of such an image is shown in Fig.6. Execution time of the script is about 35 minutes for
LFW Dataset.
Final result of this research would be access control application. Application can be installed on Raspberry Pi
which has a camera installed. All assets that are necessary for access control application to be fully functional
can be installed on Raspberry. These are mongoDB, python and python libraries. Overall process is not so
challenging, so that we do not need anything better than Raspberry Pi 3 B+ which is a model that we used
while testing the application.

�Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020213
5.

Conclusion

The aim of this study was to make the attendance checking process a lot easier for companies and schools by
using biometrics. Every employee or student would be recorded by camera at the entrance and recorded in the
system. For educational institutions cameras will be installed in classrooms so that the system can make
multiple shots during lessons. This research was successful because it made the recognition process faster by
using a document based database which is really fast.

This study will bring benefits for multiple groups. Benefit for schools is easier attendance recording and
reducing waste of time at the beginning of the classes. Also students will not have a chance to avoid coming to
classes because this system will not allow them to cheat. Similar benefit is for companies to track their
employees coming and leaving time. Future research suggestions in this field are solving problems if a person
is recorded from the side and possibly getting blurry images because of internet connection issues.

REFERENCES
[1]

B. T. Liyew and P. Hazari, “A Survey on Face Recognition based Students Attendance System.”

[2]

X. Wu, R. He, Z. Sun, and T. Tan, “A Light CNN for Deep Face Representation With Noisy
Labels,” IEEE Transactions on Information Forensics and Security, vol. 13, no. 11. pp. 2884–2896,
2018, doi: 10.1109/tifs.2018.2833032.

[3]

M. Arsenovic, S. Sladojevic, A. Anderla, and D. Stefanovic, “FaceTime — Deep learning based
face recognition attendance system,” 2017 IEEE 15th International Symposium on Intelligent Systems
and Informatics (SISY). 2017, doi: 10.1109/sisy.2017.8080587.

[4]

H. Ye et al., “Face Recognition via Active Annotation and Learning,” Proceedings of the 2016
ACM on Multimedia Conference - MM ’16. 2016, doi: 10.1145/2964284.2984059.

[5]

J. Li et al., “Robust Face Recognition with Deep Multi-View Representation Learning,”
Proceedings of the 2016 ACM on Multimedia Conference - MM ’16. 2016, doi:
10.1145/2964284.2984061.

[6]

S. Chintalapati and M. V. Raghunadh, “Automated attendance management system based on face
recognition algorithms,” 2013 IEEE International Conference on Computational Intelligence and
Computing Research. 2013, doi: 10.1109/iccic.2013.6724266.

[7]

“LFW Face Database : Main.” [Online]. Available: http://vis-www.cs.umass.edu/lfw/. [Accessed:
19-Jan-2019].

[8]

“OpenCV library.” [Online]. Available: https://opencv.org/. [Accessed: 09-Jan-2019].

�</text>
                  </elementText>
                </elementTextContainer>
              </element>
            </elementContainer>
          </elementSet>
        </elementSetContainer>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26336">
                <text>Biometrics Based Access Control System</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26337">
                <text>Mujo Hadžimehanović, ​Dino Kečo,​ ​Demir Korać​</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26338">
                <text>​Access control includes attendance checking and intrusion prevention. It is used to protect&#13;
property, employees and other assets of a company or institution. Since attendance checking and&#13;
intrusion detection are important segments of many educational institutions and other businesses as&#13;
well, it is important to make these processes faster, easier and as convenient as possible. Lots of&#13;
institutions are suffering from unreliable attendance checking methods, so we have decided to use&#13;
biometrics, more precisely face recognition to automate and improve this overall process. As part of&#13;
this study the full system has been implemented for recognition of people. As an example of usage in&#13;
an educational institutions multiple photos will be recorded during the class session, so that in case&#13;
of students leave class after the first shot, they will be removed from attendance sheet. All&#13;
recognized people will be stored in Mongo database as an array of features and later read from&#13;
database and processed by using Python script for face recognition. All educational institutions are&#13;
going to have benefits from this study. Benefits would be improving attendance management and&#13;
security.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26339">
                <text>Keywords - attendance, face, images, recognition</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="43">
            <name>Identifier</name>
            <description>An unambiguous reference to the resource within a given context</description>
            <elementTextContainer>
              <elementText elementTextId="26340">
                <text>2637-2835</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26341">
                <text>International Burch University, Sarajevo, Bosnia and Herzegovina</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26342">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26393">
                <text>January, 2020</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3449" public="1" featured="0">
    <fileContainer>
      <file fileId="4265">
        <src>https://omeka.ibu.edu.ba/files/original/a7be93f106bd4655fce2c143192ca565.pdf</src>
        <authentication>671002f6cf69f8a0bc016d089e362b6e</authentication>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26328">
                <text>Transient Stability Analysis on Modified IEEE 14-Bus System&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26329">
                <text>Ermin Šunj, Ammar Arpadžić, Mirza Šarić</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26330">
                <text>Power system stability represents an important condition of the safe and efficient operation of&#13;
the electric power system. This paper presents the transient stability analysis for the case of the new&#13;
generator grid connection. The analysis is performed on a modified IEEE 14-Bus test system. In total,&#13;
two cases were analysed. In the first case, the analysis with the maximum installed power of the generator&#13;
(250 MW) is carried out, while in the second case, the analysis is performed using the optimal generator&#13;
ratings (75 MW) using the DIgSILENT PowerFactory software. The transient stability analysis was&#13;
carried out under the three-phase symmetrical faults and the N-1 criterion requirements. The results&#13;
indicate that power system large disturbances significantly influence system operation and&#13;
characteristics. This paper demonstrates the importance of transient stability analysis, which is an&#13;
important part of power system studies and must be included in generator grid connection approval.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26331">
                <text>active power, IEE 14-Bus system, N-1 criterion, reactive power, rotor angle, transient&#13;
stability</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="43">
            <name>Identifier</name>
            <description>An unambiguous reference to the resource within a given context</description>
            <elementTextContainer>
              <elementText elementTextId="26332">
                <text>2637-2835</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26333">
                <text>International Burch University, Sarajevo, Bosnia and Herzegovina</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26334">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26392">
                <text>January, 2020</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3448" public="1" featured="0">
    <fileContainer>
      <file fileId="4262">
        <src>https://omeka.ibu.edu.ba/files/original/cda47db38a0256d84f45336ef7e1fa5e.pdf</src>
        <authentication>9d17336775d02b9918c005099df3eb63</authentication>
        <elementSetContainer>
          <elementSet elementSetId="4">
            <name>PDF Text</name>
            <description/>
            <elementContainer>
              <element elementId="52">
                <name>Text</name>
                <description/>
                <elementTextContainer>
                  <elementText elementTextId="26399">
                    <text>Journal of Natural Sciences and Engineering, Vol. 2, (2020)
DOI number: 10.14706/JONSAE2020211

Impact of Electric Vehicles in a Grid-to-Vehicle Mode on Voltage Stability

Naida Nalo1, Emina Kišija1, Mirza Šarić1
1

International Burch University, Sarajevo, Bosnia and Herzegovina
naida.nalo@stu.ibu.edu.ba
emina.kisija@stu.ibu.edu.ba
mirza.saric@ibu.edu.ba

Abstract — With a rapid development and a massive deployment of electric vehicles, the power system
is facing many challenges regarding power quality and voltage stability. This paper deals with the impact
of electric vehicle in grid-to-vehicle mode depending on different EV penetration levels and point of
connection on static voltage stability impact of a real low-voltage distribution network. Based on nine
variations created, results showed that connecting vehicles closer to the beginning of the feeders creates
a smaller voltage drop, therefore more vehicles can be connected. However, going farther from the feeder
causes voltage to go below 0.9 p.u. and eventually leads to instability.
Keywords—electric vehicles, grid-to vehicle, static voltage stability.

1. Introduction

Power system stability can be defined as the ability of the system to remain in an equilibrium state under normal
operating conditions and to regain that equilibrium state after being subjected to a physical disturbance [1].
According to [2] power system stability is defined as a term applied to alternating – current electric power
systems, denoting a condition in which the various synchronous machines of the system remain in synchronism,
or „in step“, with each other. On the opposite side, instability is defined as a condition involving loss of
synchronism or falling „out of step” [2]. However, instability can also occur without the loss of synchronism
[1].
Electric vehicles (EVs) are considered to be a promising solution both for reducing air pollution and also as
being introduced as a new form of distributed generation when working in a vehicle-to-grid (V2G) mode. Many
countries are offering incentives and by doing so, motivating EV owners to charge their vehicles in scheduled
times, to help flatten the daily peaks [3].
In the stability calculations, the behavior of the system under the effect of a transient disorder is of interest.
Equipment reacts as a system response to a disorder. In each situation, only part of the protection reacts,
therefore, the problem must be simplified and the key factors for each type of instability must be defined.

�In this project, the problem of voltage stability is investigated in a LV distribution network involving 46 buses
and 42 loads. According to different penetration levels of (EVs) and point of connection across the network,
nine scenarios were modeled and examined for voltage instability.
1.1 Voltage stability
Voltage stability is the ability of the system to maintain acceptable voltage values on all busbars in the system,
both in normal operating conditions and after the effects of the disruption. Voltage instability occurs when a
disorder, which can be caused by an increase in consumer demand or a change in operating conditions, causes
a progressive and uncontrollable voltage drop. The main cause of voltage instability is the inability of the
system to respond to reactive power requirements. The core of the problem is usually a decrease in the voltage
in the flow of active and reactive power through inductive reactances representing the transmission network
[1].
The criterion for voltage stability is that on all busbars in the system, under certain operating conditions, the
bus voltage is increasing as injection of reactive power on the same buses is increasing. Therefore, the system
is unstable if voltage level is decreasing as reactive power is increasing, at least on one busbar in the system.
Voltage stability is a local phenomenon, but its consequences may have a widespread impact.
A voltage collapse is far more complex than voltage instability and according to [4], can be explained as an
inability of the power system to supply the reactive power or as an excessive absorption of reactive power by
the system itself. It can also be defined as a process in which voltage instability causes very low voltage levels
in a substantial part of the system. A local voltage collapse can and will lead to a widespread collapse of the
power system [4].
2. Literature review

Electric vehicles have experienced a warm welcome by pollution-aware society. Their non-polluting nature
helped them gain popularity and become one of the most sold cars in Norway, according to the Norwegian
Road Federation [5]. However, deploying large fleets of electric vehicles impacts the load profile of the
network since EVs are introduced as additional loads when being connected for charging [3].
A study in [6] focused on the static voltage stability impact of EV charging stations. A cluster load model
equivalent to 20 sets of EV chargers was taken into consideration along with the different probability
distributions of state of charge (SOC) of the batteries. The authors concluded that charging stations are most
likely to cause voltage instability due to the variableness of power during the charging process.
Research conducted in [7] focused on the static voltage stability of plug-in EVs with respect to different
charging methods. The results showed that voltage stability is closely related to the proportion of the constant
impedance and the constant power load. Since EVs were considered as constant power load, the less the initial
voltage drop percentage, the more EVs will be allowed to access the distribution network.

�Authors of [8] investigated the power quality and dynamic stability aspects of vehicle to grid connection of
EVs which uses a bidirectional power flow and allows the EVs to give back to the grid if needed. Their
conclusion was that charging and discharging state of the PEV does not affect negatively neither the voltage
stability nor frequency since they remain within allowed limits.
A study in [9] investigated the impact of high PV penetration in a low-voltage distribution network on voltage
stability. In the paper, PV curves were used to analyze the static voltage stability in a test node of an important
and possibly critical line. It was shown that the node situated near the end of the network had the weakest PV
characteristics due to power loading and the distance from the feeder. However, they concluded that integrating
photovoltaic units with 40% penetration level would optimize the voltage stability of that node.
Research conducted in [10] analyzes voltage stability with aid of PV curves on an example of a real
transmission network. EVs included in the study were all charged during the daily peak load with a six-hour
charge time. Results showed that high levels of EV penetration, with the expected annual increase, leads to
unacceptably large voltage variations.
A new method for analyzing the impact of PEVs in distribution networks was proposed in [11]. As in many
other papers, this study confirmed that a small number of vehicles does not create stability issues whereas a
large fleet of vehicles causes a greater effect on the grid. Charging strategies as the overall conclusion was
highlighted in this paper.
A study in [12] investigated an impact of EV charging on voltage variations and unbalance in a real low voltage
distribution network. Different scenarios were created to depict several EV penetration levels and load
distributions across the network. Results confirmed the work of other papers, showing that point of connection
plays an important role to the level of impact of EVs to two analyzed power quality parameters.
An analysis, similar to [9], will be conducted in this paper, using PV curves to determine the critical busbars
along the two feeders of the low-voltage distribution network. Section III explains the modeled network and
created variations, Section IV draws results and Section V draws conclusions.
3. Methods
3.1 Problem formulation
Electric vehicles in this project are treated as single-phase loads connected to the network, in a grid-to-vehicle
(G2V) mode. Because of the increasing number of EV charging stations being integrated to the power system,
analysis of their clustering effect and influence on the static voltage stability have become important and
necessary. In this project, analysis of an impact of EV charging on voltage stability is performed on a real
example of a part of a distribution network.

�3.2 P-V Curves
In voltage stability studies, characteristics of interest are the relationships between transmitted power P,
receiving end voltage V and reactive power injection Q. P-V and Q-V curves are traditional forms of displaying
these relationships. In this project, P-V curves are analyzed. Power-Voltage analysis process includes
increasing transfers of power (MW) and monitoring what happens with voltages in the system. This is done by
increasing the power system load and, at each increment, power flows are recomputed (P-V curve is non-linear
and full power flow solutions are required) until the nose of the PV curve is reached, that is, the maximum
transferred power [13]. That point represents the critical voltage because after that, rapid decline of voltage
occurs. Therefore, reaching maximum power is highly avoided because operating at or near stability limit risks
a large – scale blackout. Power margin between the current operating point and critical voltage is used as
voltage stability criterion [14].
3.3 LV Distribution Network Modeling and Variations
In this paper, the analysis was done using the model of 46 – bus LV distribution network, with total of 42 loads
distributed along two feeders, modeled in DIgSILENT Power Factory. Modeled network is provided in
Appendix 1. Length of the first feeder is 371 m while the length of the second feeder is 253 m. Nine variations
were modeled, including: low, medium and high EV penetration levels, at the beginning, in the middle and at
the end of the network. List of variations is provided in Table 1. Numbers of EVs included in each variation
are presented in Table 2 and Table 3.
Table 1. Network variations (penetration-point of connection)
Case no.

Network Variations
1.1 Low-beginning
1.2 Medium-beginning
1.3 High-beginning
2.1 Low-middle
2.2 Medium-middle
2.3 High-middle
3.1 Low-end
3.2 Medium-end
3.3 High-end

1

2

3

Table 2. Number of EVs in Variations
Network Variations
Variati
on No.
1.1
2.1
3.1
1.2
2.2
3.2

Penetration
Level

Number of
EVs

Percentage
of
penetration
level

Low

6

≈15%

Medium

12

≈30%

�Table 3. Number of EVs in Variations cont’d
Network Variations
Variati
on No.
1.3
2.3
2.4

Penetration
Level

Number of
EVs

Percentage
of
penetration
level

High

21

50%

4. Results
4.1 Case 1 - EVs distributed at the beginning
First three variations were modeled so that electric vehicles are placed near the beginning of the two feeders.
Each variation had a different penetration level of EVs as explained in Table 1. After the load flow calculation
was performed, Transmission Network Toolbox was activated, and PV curves were calculated. To see which
busbars stay within the allowed limits and which do not, several busbars were selected from the beginning,
middle and end of each of two feeders and included in the resulting PV graph. The obtained graphs for Case 1
variations are presented in Figure 1, Figure 2 and Figure 3, respectively.

Fig. 1. PV curves for Variation 1.1

�Fig. 2. PV curve for Variation 1.2

Fig. 3. PV curve for Variation 1.3
All busbars whose PV curves are above the voltage value of 0.9 p.u. are acceptable and stable, while those
below 0.9 p.u. are not stable and therefore not acceptable. As presented in the graphs, two busbars, plotted in
blue and grey, have values below 0.9, which makes them unstable. These two busbars are from the first feeder,
situated in the middle and at the very end of the feeder. All busbars from the second feeder stayed within
allowed limits, as well as the busbar from the beginning of the first feeder.

�4.2 Case 2 – EVs distributed in the middle
Three variations examined for the impact of EV charging and placement around the middle of the two feeders
were 2.1, 2.2 and 2.3 Number of EVs connected to the feeders are with respect to Table 1. Several busbars
were selected and included in resulting PV graph, to depict the voltage stability across the two feeders, that is,
to show how stable are busbars from the beginning, middle and end of the two feeders. Results for the
abovementioned variations are shown in Figure 4, Figure 5 and Figure 6.

Fig. 4. PV curve for Variation 2.1

�Fig. 5. PV curve for Variation 2.2

Fig. 6. PV curve for Variation 2.3
According to results obtained from the PV graph, conclusions similar to those in previous variation can be
drawn. All busbars from the second feeder and only the busbar from the very beginning of the first feeder stay
within allowed limits of stability, that is above 0.9 p.u. value of voltage, shown in the y-axis. Two busbars from
the middle and at the end of the first feeder show instability.

�4.3 Case 3 – EVs distributed at the end
Last three variations from Figure 2 were modeled to investigate how much EVs connected near the end of the
feeders will affect voltage instability of selected busbars across the two feeders. Number of connected vehicles
per variation is shown in Table 1. Selected busbars remained the same as those used in the previous six
variations. Results obtained are shown in Figure 7, Figure 8 and Figure 9.

Fig. 7. PV curve for Variation 3.1

�Fig. 8. PV curve for Variation 3.2

Fig. 9. PV curve for Variation 3.3
Results from the obtained PV graph of the last tested variations show that voltages of the two terminals from
the middle and end of the first feeder experience a drop below 0.9 p.u. but gets a more constant value when
compared to results of previous variations. All busbars from the second feeder and only one from the very
beginning of the first feeder have values greater than 0.9 p.u., making them well within allowed limits of
voltage stability.

�5.

Conclusion

The purpose of this paper was to analyze the impact of different EV penetration and points of connection on
voltage stability of a distribution network. Load flow analysis was performed on all nine scenarios followed
by a PV curve calculation in Transmission Network Toolbox of DIgSILENT. Then, a static voltage stability
analysis was performed using PV curves for a number of selected busbars from the beginning, middle and end
of the two feeders. The criterium was that all curves above 0.9 p.u. value of voltage were acceptable, and all
below show voltage instability.
It was found that the length of the feeder, point of connection and level of EV penetration played a crucial part
when it comes to voltage stability. All selected busbars from the second feeder remained within allowed limits
of voltage values while only one busbar from the beginning of the first feeder was above the limiting value.
This was mainly due to the length of the feeder, amount of loading and the distance from the source feeder.
For large fleets of EVs being connected and charged at the same time, voltage stability and power quality
becomes of crucial importance. Distribution system operators must pay attention to the impacts of charging on
power quality and stability of the distribution system, especially if vehicles are in close range, and situated
farther from source.
If no modifications are to be made to increase the network’s capacity, then only a limited number of vehiclescan
be allowed, with reference to the point of connection. A possible solution could be integration of small
distributed generators or implementation of renewables, dispersed along the network to decrease voltage
variations and increase power quality, especially near the end of the feeders, where critical nodes are.
Future work might include analyzing and modeling the impact of connection of photovoltaics or small wind
generators in terms of distributed generation, which are expected to improve the voltage levels and overall
variations.

�APPENDIX 1

�6.

References

[1] N. Rajaković, Analiza elektroenergetskih sistema II, Elektrotehnički fakultet, 2007
[2] E.W. Kimbark, Power system stability (Vol. 1). John Wiley &amp; Sons, 1995
[3] J.Y. Yong, V.K. Ramachandaramurthy, K.M. Tan, N. Mithulananthan, A Review on the State-Of-Art
Technologies of Electric Vehicle, Its Impacts and Prospects, Elsevier, Renewable and Sustainable Energy
Review 49 (365-385) 2015
[4] P. Kessel, H. Glavitsch, Estimating the Voltage Stability of A Power System. IEEE Transactions on Power
Delivery, 1(3), 346-354, 1986
[5] Electric vehicles are now the majority of cars sold in Norway, (20198, April 1st), Retrieved on 27th of May,
from: https://www.autoblog.com/2019/04/01/electric-vehicles-are-now-the-majority-of-cars-sold-in-norway/
[6] M. Zhang, J.H. Zheng, W.Z. Wang, M.T. Dai, Research on Static Voltage Stability Based on EV Charging
Station Load Modeling, the international conference on Advanced Power System automation and Protection,
2011 IEEE
[7] Y. Zhang, S. Xiaohui et al., Research of Voltage Stability Analysis Method in Distribution Power System
with Plug-In Electric Vehicle, PES Asia-Pacific Power and Energy Conference, 2016 IEEE
[8] E. Alghsoon, A. Harb, M. Hamdan, M., Power Quality and Stability Impacts of Vehicle To Grid (V2G)
Connection, The 8th International Renewable Energy Congress (IREC 2017)
[9] M. Ghaffarianfar, A. Hajizadeh, Voltage Stability of Low-Voltage Distribution Grid with High Penetration
of Photovoltaic Power Units, Energies 2018, 11(8), 1960, Available at: https://doi.org/10.3390/en11081960
[10] S. Avdaković, A. Bosović, Impact of Charging a Large Number of Electric Vehicles on the Power System
Voltage Stability, Elektrotehniki Vestnik,137-142, 2014.
[11] Y. Kongjeen, K. Bhumkittipich, Impact of Plug-in Electric Vehicles Integrated into Power Distribution
System Based on Voltage-Dependent Power Flow Analysis, Energies 2018.
[12] N. Nalo, A. Bosović, M. Musić, Impact of Electric Vehicle Charging on Voltage Profiles and Unbalance
on Low Voltage, Advanced Technologies, Systems, and Applications IV – Proceedings of the International
Symposium on Innovative and Interdisciplinary Applications of Advanced Technologies (IAT 2019), vol 83.
Springer
[13] C. Reis, F.M. Barbosa, A Comparison of Voltage Stability Indices. MELECON Mediterranean
Electrotechnical Conference (pp. 1007-1010). IEEE, May 2006
[14] C.A. Cazares, Voltage Stability Assessment: Concepts, Practices and Tools. IEEE/PES Power System
Stability Subcommittee Special Publication 2002 (SP101PSS).

�</text>
                  </elementText>
                </elementTextContainer>
              </element>
            </elementContainer>
          </elementSet>
        </elementSetContainer>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26321">
                <text>Impact of Electric Vehicles in a Grid-to-Vehicle Mode on Voltage Stability</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26322">
                <text>Naida Nalo, Emina Kišija, Mirza Šarić</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26323">
                <text>With a rapid development and a massive deployment of electric vehicles, the power system is&#13;
facing many challenges regarding power quality and voltage stability. This paper deals with the impact&#13;
of electric vehicle in grid-to-vehicle mode depending on different EV penetration levels and point of&#13;
connection on static voltage stability impact of a real low-voltage distribution network. Based on nine&#13;
variations created, results showed that connecting vehicles closer to the beginning of the feeders creates a&#13;
smaller voltage drop, therefore more vehicles can be connected. However, going farther from the feeder&#13;
causes voltage to go below 0.9 p.u. and eventually leads to instability.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26324">
                <text>electric vehicles, grid-to vehicle, static voltage stability</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="43">
            <name>Identifier</name>
            <description>An unambiguous reference to the resource within a given context</description>
            <elementTextContainer>
              <elementText elementTextId="26325">
                <text>2637-2835</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26326">
                <text>International Burch University, Sarajevo, Bosnia and Herzegovina</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26327">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26391">
                <text>January, 2020</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3447" public="1" featured="0">
    <fileContainer>
      <file fileId="4240">
        <src>https://omeka.ibu.edu.ba/files/original/3c43bbdca717ab9bb8b214e6318c4941.pdf</src>
        <authentication>a85ed28176bdfa2c9a8b13c68a6c6736</authentication>
        <elementSetContainer>
          <elementSet elementSetId="4">
            <name>PDF Text</name>
            <description/>
            <elementContainer>
              <element elementId="52">
                <name>Text</name>
                <description/>
                <elementTextContainer>
                  <elementText elementTextId="26319">
                    <text>Essentials of Digital Forensics
Start

Detected security
incident with
digital devices
used

Notification
LAW
Enforcement

PR

Digital forensic
action initiated in
written form

Security staff
notified

Preservation
Initial Incident
type identification

Collection

Consent form

Post-mortem
Live acquisition

Invoke incident
response team

Examination
Fraud
Malware

Analysis

Unauthorised
access
Network related
incident

Outcome
satisfied

DoS/DDoS
Domestic violence

NO

YES

Homicide

Managament

Reporting

Notification
End

Kemal Hajdarevic with
Nermin Ziga and Mirza Halilovic

�II

�Essentials of Digital Forensics

Kemal Hajdarevic with
Nermin Ziga and Mirza Halilovic

Sarajevo, 2019

III

�Authors:
Dr. Kemal Hajdarevic with Nermin Ziga and Mirza Halilovic
Proofreading: Ana Tankosic
Publisher:
International Burch University
Editor-in-Chief:
Dr. Kemal Hajdarević
Reviewed by: Dr. Hamid Jahankhani, Dr Jasmin Azemovic and Dr. Colin Pattinson

DTP &amp; Design:
Dr. Kemal Hajdarevic
DTP and Prepress:
International Burch University
Circulation: electronic copy
Place of Publication: Sarajevo
Copyright: International Burch University, 2019
Reproduction of this Publication for educational or other non-commercial purposes is
authorized without prior permission from the copyright holder. Reproduction for resale or
other commercial purposes prohibited without prior written permission of the copyright
holder.
Disclaimer: While every effort has been made to ensure the accuracy of the information,
contained in this publication, International Burch University will not assume liability for
writing and any use made of the proceedings, and the presentation of the participating
organizations concerning the legal status of any country, territory, or area, or of its
authorities, or concerning the delimitation of its frontiers or boundaries.
----------------------------------CIP - Katalogizacija u publikaciji
Nacionalna i univerzitetska biblioteka
Bosne i Hercegovine, Sarajevo
343.98:004
HAJDAREVIĆ, Kemal
Essentials of digital forensics [Elektronski izvor] / Kemal Hajdarevic, Nermin Ziga, Mirza Halilovic. - El. knjiga.
- Sarajevo : International Burch University, 2019
Način pristupa (URL): https://omeka.ibu.edu.ba/items/show/3447. - Nasl. sa nasl. ekrana. - Opis izvora dana
11. 7. 2019.
ISBN 978-9958-834-66-0
1. Žiga, Nermin 2. Halilović, Mirza
COBISS.BH-ID 27750406
-----------------------------------

IV

�Table of Contents

Author’s Preface ......................................................................................................... XI
IMPORTANT DEFINITIONS ......................................................................................XIII
PURPOSE OF THIS BOOK........................................................................................... XV
COMPUTER FORENSICS AND INFORMATION SECURITY TRAINING COURSES ........ XV
JOBS RELATED TO COMPUTER FORENSICS AND INFORMATION SECURITY ............ XVI
ORGANISATION OF THE BOOK SECTIONS ............................................................. XVII
LEARNING TRACKS ............................................................................................. XVIII
1.

Introduction to digital forensics ........................................................................ 1
CHAPTER ABSTRACT .................................................................................................. 1
HISTORY OF FORENSICS.............................................................................................. 1
HISTORY OF DIGITAL FORENSICS ............................................................................... 4
DIGITAL FORENSICS – DEFINITION ............................................................................. 5
DIGITAL EVIDENCE .................................................................................................... 5
DIGITAL VS. COMPUTER FORENSICS .......................................................................... 5
DIGITAL TRANSFORMATION IMPACT ON DIGITAL FORENSICS .................................. 6
AUDIT VS. DIGITAL FORENSIC INVESTIGATION ......................................................... 7
DIGITAL FORENSIC PROCESS ...................................................................................... 8
DIGITAL FORENSIC SCOPE .......................................................................................... 8
Personal computers and servers ............................................................................. 9
Network devices and active components .............................................................. 10
Databases ............................................................................................................. 10
Mobile Devices ..................................................................................................... 11
Digital Images ...................................................................................................... 11
Multimedia .......................................................................................................... 11
Memory ................................................................................................................ 11
FORENSIC INVESTIGATION INITIATION .................................................................... 12
INCIDENT RESPONSE ................................................................................................ 13
SUMMARY ................................................................................................................ 14
KNOWLEDGE ACQUIRED .......................................................................................... 14

V

�REVIEW QUESTIONS.................................................................................................. 14
FURTHER READINGS ................................................................................................. 15
VIDEO RESOURCES ................................................................................................... 15
2.

Digital forensics – classification ...................................................................... 17
CHAPTER ABSTRACT ................................................................................................ 17
DIGITAL FORENSIC CLASSIFICATION BASED ON DATA SOURCE .............................. 17
Forensics of general computer systems ................................................................ 18
Database forensics ................................................................................................ 19
Forensics of multimedia ....................................................................................... 23
Watermarking ...................................................................................................... 23
Digital signatures ................................................................................................ 23
Mobile device forensics ......................................................................................... 23
Network forensics................................................................................................. 24
SUMMARY ................................................................................................................ 25
KNOWLEDGE ACQUIRED .......................................................................................... 25
REVIEW QUESTIONS.................................................................................................. 25
FURTHER READINGS ................................................................................................. 25
VIDEO RESOURCES ................................................................................................... 26

3.

Digital forensics – process ................................................................................ 27
CHAPTER ABSTRACT ................................................................................................ 27
STEPS IN THE DIGITAL FORENSIC INVESTIGATION PROCESS .................................. 27
Preservation ......................................................................................................... 29
Collection ............................................................................................................. 31
Transport ............................................................................................................. 32
Examination ......................................................................................................... 32
Analysis ............................................................................................................... 33
TYPES OF DIGITAL EVIDENCE ANALYSIS................................................................. 33
Media analysis ..................................................................................................... 34
Media management analysis ................................................................................ 34
File system analysis ............................................................................................. 34
Network analysis.................................................................................................. 35
Application analysis ............................................................................................. 35
Operating System (OS) analysis ......................................................................... 36
Executable analysis .............................................................................................. 36
Image analysis ...................................................................................................... 36

VI

�Video analysis ...................................................................................................... 36
Memory Analysis ................................................................................................. 37
Reporting.............................................................................................................. 37
DIGITAL EVIDENCE COLLECTION ............................................................................ 38
Live Data collection.............................................................................................. 39
Post-mortem data collection ................................................................................. 41
DATA CONCEALMENT.............................................................................................. 42
Spoliation ............................................................................................................. 42
Encryption ........................................................................................................... 42
Steganography ..................................................................................................... 42
SUMMARY ................................................................................................................ 46
KNOWLEDGE ACQUIRED .......................................................................................... 46
REVIEW QUESTIONS.................................................................................................. 47
FURTHER READINGS ................................................................................................. 47
VIDEO RESOURCES ................................................................................................... 47
4.

Digital forensics – tools .................................................................................... 49
CHAPTER ABSTRACT ................................................................................................ 49
DIGITAL FORENSIC TOOLS ....................................................................................... 49
HARDWARE DIGITAL FORENSIC TOOLS AND THEIR USAGE ..................................... 50
Usage of hard disk docking stations ..................................................................... 50
Usage of memory card docking stations ............................................................... 51
Usage of Portable Computer Forensic Lab ........................................................... 51
USAGE OF GENERAL COMPUTER FORENSIC TOOLS................................................. 52
Disk Genius usage ............................................................................................... 52
DD command tool usage ...................................................................................... 53
Busybox usage ...................................................................................................... 54
Hash Calculation ................................................................................................. 54
DATABASE TOOLS USAGE ......................................................................................... 55
Usage of the Oracle LogMiner ............................................................................. 55
Usage of the IBM Guardium Data Protection for Databases .............................. 57
Usage of the DB Browser for SQlite .................................................................... 58
Usage of the Undark - a SQLite data recovery tool .............................................. 59
Usage of the SQLite-Deleted-Records-Parser ...................................................... 60
USAGE OF THE NETWORK FORENSIC TOOLS............................................................ 60
Wireshark usage ................................................................................................... 60

VII

�NIKSUN NetDetector usage ............................................................................... 62
Xplico usage ......................................................................................................... 62
USAGE OF THE MOBILE DEVICE FORENSIC TOOLS ................................................... 63
Rooting Tools usage ............................................................................................. 63
Santoku usage ...................................................................................................... 64
AF Logical OSE usage ......................................................................................... 67
Autopsy and the Sleuth Kit usage........................................................................ 67
Ingest Module usage ............................................................................................ 71
Android Analyser module usage .......................................................................... 72
Accessing Partitions ............................................................................................ 73
Timeline ............................................................................................................... 74
Reporting ............................................................................................................. 76
SUMMARY ................................................................................................................ 77
KNOWLEDGE ACQUIRED .......................................................................................... 78
REVIEW QUESTIONS.................................................................................................. 78
FURTHER READINGS ................................................................................................. 79
VIDEO RESOURCES ................................................................................................... 80
5.

Simulation of digital forensic cases................................................................. 81
CHAPTER ABSTRACT ................................................................................................ 81
CASE 1: FORENSIC DATA RECOVERY OF FILES ON PC.............................................. 81
CASE 2: FORENSIC INVESTIGATION OF VIBER, VOICE CALL, SMS, AND COCO ON
AN ANDROID MOBILE DEVICE .................................................................................. 84

DEFINING THE SCOPE OF THE INVESTIGATION ....................................................... 84
PREPARING THE ENVIRONMENT FOR THE DATA ACQUISITION ............................. 86
Rooting the Device ............................................................................................... 87
Busybox Sideloading ............................................................................................ 91
Determining Partitions and Blocks ..................................................................... 93
ACQUIRING DATA FROM THE EVIDENCE DEVICE ................................................... 95
Logical data acquisition........................................................................................ 95
Physical data acquisition...................................................................................... 98
IMPORTING IMAGE FILE INTO AUTOPSY ............................................................... 100
ANALYSIS OF THE ACQUIRED MOBILE DEVICE DATA .......................................... 100
Analysis of Logically Acquired Data ................................................................. 100
Analysis of the Physically Acquired Data ......................................................... 102
Viber Message and Call Investigation ............................................................... 104

VIII

�SMS Message Investigation .............................................................................. 109
GSM Voice Call Investigation ........................................................................... 112
Coco Message Investigation ............................................................................... 114
INVESTIGATION FINDINGS ..................................................................................... 117
ENDING INVESTIGATIONS ...................................................................................... 118
CASE 3: DATABASE FORENSICS – USER COMPLAINTS ON HIGH BILLS ................... 120
CASE 4: DATABASE FORENSICS – SALARIES DATA LEAKAGE ................................ 122
CASE 5: DATABASE FORENSICS – DATA DELETION ................................................ 125
SUMMARY .............................................................................................................. 128
KNOWLEDGE ACQUIRED ........................................................................................ 128
REVIEW QUESTIONS................................................................................................ 129
FURTHER READINGS ............................................................................................... 129
VIDEO RESOURCES ................................................................................................. 129
6.

Conclusions ...................................................................................................... 131
CHAPTER ABSTRACT .............................................................................................. 131

Appendix – Consent Form...................................................................................... 133
Appendix – Incident response form ...................................................................... 134
GENERAL DATA ABOUT INCIDENT......................................................................... 134
TYPE OF INCIDENT ................................................................................................. 134
Details for malicious software ............................................................................ 135
DoS / DDoS attack............................................................................................. 135
Details for an unauthorized access: .................................................................... 135
Leakage of data and information in public: ........................................................ 135
Appendix – Digital forensic process ..................................................................... 136
List of Figures ........................................................................................................... 138
List of Tables ............................................................................................................. 141
Acronyms .................................................................................................................. 143
References ................................................................................................................. 145
Index .......................................................................................................................... 159
About authors ........................................................................................................... 163

IX

�X

�Author’s Preface

Information

available

on

Internet

Live

Stats

web

site

(www.internetlivestats.com) that 40 percent of world’s population is
using Internet Media almost daily reports on different cyber and digital
security incidents. Many more similar incidents have never been reported
or they have been reported years after they had occurred due to the fact
that they could have jeopardised ongoing law enforcement investigations
or because they could have been embarrassing and thus negatively affect
reputation of the victim – organisation or a person.
After cyber- or information security incident, the obvious step is to make
efforts to minimize losses, establish practices to avoid future similar
situations, and punish executioners and/or masterminds of the incident to
discourage future attempts.
To be able to accomplish the above-mentioned goals it is necessary to
understand the nature of the incident, actual losses, and detect, collect, and
preserve evidence, as well as to detect and locate executives of attack that
led to the cyber incident.
A common scientific approach of collecting, preserving, analysing, and
reporting criminal cases where computers and digital equipment are used
XI

�or where they have been an object of the attack is called the digital
forensics. If a specific device or software is the object of the forensic
investigation process, the scientific approach can be called computer
forensics, network forensics, database forensics, etc.
There are different areas of digital forensics based on the object of the
criminal activity and on technological tools used to commit an attack.
Digital forensics can be performed by external forensic service or it can
be done in a house. Knowledge about forensic process is very important
even if the external forensic knowledge or service is used so that affected
organisation is able to monitor external forensic service or to perform
forensics internally if there are enough internal resources for such an
activity.
Some of the first professionals that can detect criminal or fraud activities
where computers are involved are computer operators and system or
network administrators. Another profession that can have an active role in
detecting fraud or abuse of the system resources are internal auditors.
Because internal and external auditors have experience, and a broad
knowledge of computer and network systems, they can detect criminal
activity and perform initial forensic analysis. However, forensics and
audit are not the same process, and differences between the two are
presented in this book.
Not every organisation is obliged to have a regular internal and external
audit, or testing for technical vulnerabilities (also called penetration
XII

�testing), nevertheless, from the experience of organisations which have
this type of assurance and from incidents which occurred in the past,
regular vulnerability checks are needed. Auditors can be given the task by
the top management to analyse a fraudulent or a criminal activity as
professionals who already have an in-depth knowledge of the specific
system. Furthermore, revealing the information about fraud or crime to
the public can bring a negative publicity.
That is why it is important for computer professionals, information
technology professionals, and internal auditors to understand steps and
procedure of the digital forensic investigation process. It is also important
for them to understand what a good digital forensic practice should be and
what should not be done during the forensic process.
The aim of this book is to clarify forensic topics and bring them closer to
students, professionals, information security managers, internal auditors,
and other IT specialists who want more information about digital forensic
process, tools, and activities. Based on Criminal Justice Degree Schools
(2019) as well as courses and authors’ experience in teaching, this book
also names potential and some already taught courses in computer
forensics and information security.

Important definitions
Data - “factual information (such as measurements or statistics) used as
a basis for reasoning, discussion, or calculation, (Data, MerriamWebster, 2019)

XIII

�Information – “a signal or character (as in a communication system or
computer) representing data; the communication or reception of
knowledge or intelligence, (Information, Merriam-Webster, 2019)
Information technology – “the technology involving the development,
maintenance, and use of computer systems, software, and networks for the
processing and distribution of data”. (Information technology, MerriamWebster, 2018).
Information system (IS) – “an integrated set of components for
collecting, storing, and processing data and for providing information,
knowledge, and digital products… The main components of information
systems are computer hardware and software, telecommunications,
databases and data warehouses, human resources, and procedures…”,
(Information system, Britanica, 2019)
Information System (IS) Security – “Refers to the activities, processes,
methodologies, frameworks, and standards used for the maintenance of
information and information assets confidentiality, integrity, and
availability”. (Techopedia, 2018)
Forensics – “belonging to, used in, or suitable to courts of judicature or
to public discussion and debate” (Forensic, Merriam-Webster, 2018).
Digital forensics - includes not only computers but also any digital device,
such as digital cameras, flash drives, digital networks, cell phones, IoT.
Wiley C. (2019)
XIV

�Internal auditing - “Internal auditing is an independent, objective
assurance and consulting activity designed to add value and improve an
organization's operations. It helps an organization accomplish its
objectives by bringing a systematic, disciplined approach to evaluate and
improve the effectiveness of risk management, control, and governance
processes.” (IIA, 2019)

Purpose of this book
The purpose of this book is to provide an insight into forensics of
computer and other digital devices. This is because the world of physical
operations and business is changing into digital and the world of Internet
wherever possible, thus creating a greater risk of cyber-attacks. In
common business surroundings, criminal activities are not something that
business owners would like to encounter. Considering that digital world
and cyber-attacks are not something that business owners usually come in
contact with, they are more often not prepared for the aftermath of the
potential incident. They are also unaware of their need for the computer
or digital forensics investigation process. Thus, the purpose of this book
is to familiarize them with the following: Confidentiality, Integrity,
Availability (CIA), Authentication, Authentication, and Audits.

Computer Forensics and information Security Training
Courses
Following are the courses in the field of information security and cyber
forensics:
-

Computer Forensics Essentials

-

Cybercrime
XV

�-

Current Issues in Cyberlaw

-

Computer Forensics File Systems

-

Architecture of Secure Operating Systems

-

Forensic Analysis in a Windows Environment

-

Forensic Analysis in a Linux/Unix Environment

-

Malware and Software Vulnerability Analysis

-

Network Security

-

Network Forensics

-

Mobile Forensics Analysis

-

Forensic Management of Digital Evidence

-

Cyber Incident Analysis and Response

-

Digital Forensics Investigative Techniques

-

Forensic Management of Digital Evidence

-

Computer Forensic Ethics

-

Advanced Topics in Computer Forensics

-

Information Systems Security Planning and Audit

Criminal Justice Degree Schools (2019)

Jobs related to computer forensics and information
security
Based on Criminal Justice Degree Schools (2019) and authors’ experience
following are some job titles common in the cyber security industry:
-

Business Intelligence Analyst

-

Information Security Auditor

-

Information System Auditor

-

Crime Analyst

XVI

�-

Computer Forensics Investigator

-

Computer Systems Analyst

-

Cybersecurity Officer

-

Digital Forensics Investigator

-

Digital Forensics Specialist

-

Information Security Officer

-

Chief Information Security Officer

-

Information Security Analyst

Organisation of the book sections
This book is divided into six sections:
1. Introduction to digital forensics
2. Digital forensics – classification
3. Digital forensics – process
4. Digital forensics – tools
5. Simulation of digital forensic cases
6. Conclusions
While reading, it is possible to follow different tracks.

XVII

�Learning tracks
It is possible for a reader to acquire a specific set of skills and knowledge
on certain paths through different chapters.
Chapter

Introduction

Digital
forensics
classification

Digital
forensics
process

X

X

X

X

X

X

X

X

X

X
X

X
X

X
X

X
X

X
X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

Job title
Business
Intelligence
Analyst
Information
Security
Auditor
Information
System
Auditor
Crime Analyst
Computer
Forensics
Investigator
Computer
Systems
Analyst
Cybersecurity
Officer
Digital
Forensics
Investigator
Digital
Forensics
Specialist
Information
Security
Officer
Chief
Information
Security
Officer
Information
Security
Analyst

XVIII

Digital
forensics
tools

Digital
forensics
cases

X

X

X

�XIX

��1. Introduction to digital forensics

Chapter abstract
Chapter goals: Digital transformation has a great impact on cyber
forensics because of new services in place, new technologies, and devices.
This chapter presents some general information about the early
advancements in forensics, and digital forensics. It also provides the
explanation of what the digital evidence is and in what state it can be
found. Furthermore, this chapter explains different types of digital
forensics as well as the difference between digital forensic analysis types.
Digital forensics is usually followed by and triggers incident response
process which is also explained in this chapter.
Learning outcomes: Learning about one aspect of the forensic history.
Knowledge of the core principles of forensics and digital forensics.

History of forensics
In early societies there was a need to resolve different issues and disputes
in an acceptable manner so that conclusions are clear and there is no space
for ambiguities. As presented in Figure 1. the English word forensic
comes from the Latin word forum and it initially meant “in open court”
(Williams A., 2000).

�Figure 1. Word “Forensic” explanation (google, 2018)

Historians found evidence of the ancient societies’ need for clarification
of criminal and other cases in process of finding the truth for the events
that happened before, using the science of that time and common
knowledge for a better understanding of past events (Williams A., 2000).
It was a practice to present evidence to the public for comments and
criticism with a goal to make everyone aware of what happened in a
specific case. With time, forensic process became a key part of all criminal
investigation cases which came later.
Forensic process became a key step of every future criminal investigation
case, because every criminal case needed a resolution in terms of finding
who is responsible for the wrongdoings.
Edmond Locard Principle of Exchange (Crime Museum, 2019):
“..when a person commits a crime something is always left at
the scene of the crime that was not present when the person
arrived.”

2

�The “something” is the goal of every forensic investigator, and it is crucial
to detect and preserve it for the later use in the process of reporting
findings.
German born scientist Archibald Reiss was the founder of the first
academic forensic science program and Institute of forensic science at the
University of Lausanne in 1909. (Witte de With, 2019).
Through history, forensics as a discipline is perhaps mostly known from
the medical pathology cases, however, recent history shows that traffic
accident cases, usage of firearms, and digital and computer equipment
also became an important area of forensic investigations.
One view on history of forensics would certainly include usage of
fingerprints found at the crime scene. Because of its uniqueness, the
fingerprint became an important resource which is used to authenticate
each person. As some other scientific advancements, the fingerprint used
for the forensic purposes contributed more than a single inventor (History
of Fingerprints, 2018). Recent advancements in computer technology use
pictures and videos to identify a person with a high accuracy (Kremic,
Subasi, Hajdarevic, 2012).
Other important methods used for forensic purposes were blood
groupings, and DNA sampling, firearms and bullet comparison, traffic
analysis, and other (History of Fingerprints, 2018) as listed below:
-

Francis Galton, Edmond Locard – study of fingerprints

-

Leone Lattes – Discovered blood groupings (A, B, AB, &amp; 0)
3

�-

Calvin Goddard – Firearms and bullet comparison

-

Albert Osborn – Developed principles of document examination

Due to different areas where scientific forensics can help in solving
disputes, different forensic research areas emerged, some of which are
named below:
-

Forensic Pathology – Sudden unnatural or violent deaths

-

Forensic Anthropology – Identification of human skeletal remains

-

Forensic Psychiatry – Forensics of psychiatric cases

-

Forensic Odontology – Dental forensics

History of digital forensics
Computers are objects of early forensic investigations, and digital
forensics is related to all digital equipment, not only computer devices.
Today many digital devices that use, store, and communicate digital data
are available. All these digital devices are potential candidates for forensic
investigation cases.
Below is a short history of digital forensic advancements:


1984 FBI Computer Analysis and Response Team (CART) was
formed.



1991 International Law Enforcement meeting was held to discuss
computer forensics and the need for the standardized approach.



1997 Scientific Working Group on Digital Evidence (SWGDE)
was established to develop standards.



2001 Digital Forensic Research Workshop (DFRWS) was
established for development of the research roadmap.

4

�Digital forensics – definition
Digital forensic investigators use science throughout the entire process of
collecting, analysing, and reporting evidence.
Digital Forensic Science (DFS) is defined by Digital Forensic Research
Workshop (DFRWS, 2001) as:
“The use of scientifically derived and proven methods toward
the preservation, collection, validation, identification,
analysis, interpretation, documentation and presentation of
digital evidence derived from digital sources for the purpose
of facilitating or furthering the reconstruction of events found
to be criminal, or helping to anticipate unauthorized actions
shown to be disruptive to planned operations.”

Digital evidence
Heart of every digital forensic investigation is data as evidence upon
which the entire potential case is built. When considering types of digital
forensics, one approach could be to classify digital forensic analysis based
on data sources for digital investigation, because data is crucial for making
decisions, navigating through evidence, and producing the digital
forensics report.

Digital vs. Computer forensics

5

�Digital evidence is the heart of every digital forensic investigation and
sometimes the term computer forensics is used to refer to the same
process. Computer forensics is related to the forensics of computers and
related devices, as well as associated software used on computers. On the
other hand, digital forensics has a wider scope which includes digital
devices such as smart and cell phones, flash drives, media devices, and
digital cameras. The purpose of digital forensics is to determine whether
a device is used in a criminal act. Criminal act can be the computer fraud,
computer hacking, traffic accidents, illegal pornography distribution, etc.
Wiley C. (2019)

Digital forensics

Computer forenisc

Figure 2. Digital and Computer forensic realm

Digital transformation impact on digital forensics
Digital transformation has an impact on digital forensics because of an
increased number of users and digital devices.

6

�These devices are used and sometimes misused in a way that they become
objects of criminal investigations. Law enforcement agencies use sources
such as personal or business computers and Internet cache history to
analyse behaviour of suspects and law offenders with a goal to resolve
criminal cases.

Audit vs. Digital forensic investigation
Having digital devices as means of support for business and everyday life
activities poses a risk of using those devices for unlawful or other
wrongdoings. To support every activity where digital data exists, there is
a need to analyse and investigate how data and digital devices are being
used. Two general approaches for analysing and investigating digital
evidence and operation with digital data are known as audit and digital
forensic investigation.
Audit and forensic investigation are not the same and based on Marcella
and Mendey’s (2008) comparison, this book presents some major
differences between the two investigation processes.
TABLE 1. Audit vs. Digital forensic investigation
Elements
Definition

Audit
“Internal auditing is an
independent, objective assurance
and consulting activity designed to
add value and improve an
organization's operations. It helps
an organization accomplish its
objectives by bringing a systematic,
disciplined approach to evaluate
and improve the effectiveness of
risk management, control, and
governance processes.” (IIA, 2019)

Cyber Forensic Investigation
“The use of scientifically derived
and proven methods toward the
preservation, collection,
validation, identification,
analysis, interpretation,
documentation and presentation
of digital evidence derived from
digital sources for the purpose of
facilitating or furthering the
reconstruction of events found
to be criminal…”(DFRWS, 2001)

7

�Objective

To determine alignment of
organisational operation with law
regulations, bylaws, and standards.
The scope should be determined
during the planning phase and it
depends on the audit goals.
Planned regular audits or audit by
the request of management.

To detect digital evidence and
identify individuals responsible
for the wrongdoing.
All digital devices which can be
used to document a specific
case.
Part of the investigation process
after an incident in which digital
device was used.

Methodology

Professional Practice of Internal
Auditing by The Institute of Internal
Auditors.

Reporting

Reporting to the organisation or
company management.

Impact

Presented in a non-confronted
manner, the aim is to help auditee
recognise risks and improve
performances and level of
alignment with law and standards.

Available and approved local or
international methodology
which defines digital forensic
steps such as justification for
starting the forensic
investigation, getting approval
for investigation, and steps for
conducting forensic
investigation on the scene:
“…preservation, collection,
validation, identification,
analysis, interpretation,
documentation and presentation
of digital evidence derived from
digital sources…“ (DFRWS, 2001)
Reporting to prosecutor, law
enforcement, or the
organisational management.
It depends on the investigation
outcome.

Scope

Timing

Digital forensic process
Digital forensic process refers to the identification, preservation,
collection, analysis, and reporting of evidence found on any digital device
to support investigations and legal actions.

Digital forensic scope
Scope of digital forensics is not limited to specific technology, hardware,
or software component, because digital evidence can be stored in a
8

�database or file, and transferred via different network technologies.
Criteria for determining scope of digital forensic investigation can be
based on the object of attack or fraud, devices used for fraud or attack, and
vector of the attack.
Some of these sub-disciplines of digital forensics which determine digital
forensics are presented below (Open University, 2018).
Personal computers and servers
Computer forensic process is performed on computers, laptops, and
storage media.

PC
PC
Tap
PC

Switch

PC

Monitoring
device

Computer Forensic

Figure 3. Computer forensic

Forensic investigators search for digital evidence in directories, files, and
logs that can be stored on hard drives, and other media such as removable
media used with computer systems.

9

�Network devices and active components
Network forensic process includes monitoring and/or capturing,
preserving and analysing network traffic, sessions, and other network
activities or events in order to discover the source of security attacks,
intrusions, or other problem incidents, i.e. worms, virus, or malware
attacks, abnormal network traffic, and security breaches.
Special care must be taken in collecting forensic data in networks because
network traffic has to be captured in order to be analysed. In most cases if
the traffic and session are not captured, it is only possible to analyse result
of sessions and traffic generated in time before the investigation took
place.

PC
PC
Tap
PC

Switch

PC

Monitoring
device

Network Forecisc

Figure 4. Network forensics

Databases
The recovery of information from databases entails the recovery of logs
associated with database operations, as well as user and administrator
interactions with data stored in database files and logs.
10

�Mobile Devices
Mobile device forensics is the process of collecting and analysing
electronic evidence from mobile phones, smartphones, SIM cards, PDAs,
GPS devices, tablets, and game consoles.
Digital Images
Digital image forensics is the process of the extraction and analysis of
digitally acquired photographic images to validate their authenticity by
recovering the metadata of the image file to ascertain its history.
Multimedia
Multimedia forensics encompasses Digital Video/Image/Audio Forensics
which refers to the collection, analysis, and evaluation of sound, image,
and video recordings. The science in this sense refers to the establishment
of authenticity as to whether a recording is original and whether it has
been tampered with, either maliciously or accidentally.
Memory
Live acquisition or memory forensic process refers to the recovery of
evidence from the RAM of a running computer.
Triggers for digital forensics
Different events trigger digital forensic investigation such as:


Denial of service attacks



Child pornography



Domestic violence



Using organisation’s computer or other equipment for the
personal benefit



Computer fraud
11

�

Hacking



Blackmail



Extortion



Homicide cases



Missing person



Other cases

Events stated above trigger incident response which has to involve digital
forensic process.

Forensic investigation initiation
Common practice for the forensic analysis is that law enforcement
initiates the forensic analysis in a written form.

Who

What

Digital forensic
analysis goals to
detect

Where

When

Figure 4. Forensic analysis goals to detect – who, what, when, where

Other possibilities for the initiation of the digital forensics could be
company’s or organisation’s management with a goal of performing the
12

�forensic analysis to determine who, what, when, and where is something
done with the use of digital equipment (digital assets).

Incident response
Computer and digital forensics has to be a part of the incident response
due to the fact that after each incident, proper actions need to be taken so
that the future incidents are prevented, and perpetrators are punished.

Preparation

Identification

Containment

Eradiction

Recovery

Post-Mortem

Figure 5. Incident response plan (Banking and Insurance, 2017)
13

�Incident response is performed through predefined stages and it is usually
a planned activity (Banking and Insurance, 2017). It contains stages as it
is shown in Figure 5: Preparation, Identification, Containment,
Eradication, Recovery, Post-Mortem. Some useful information about the
recovery phase and post mortem-analysis can be found in the Appendix –
Incident response form.
Post-mortem is considered to be the initial step of the digital forensic
process which is explained in Chapter 3.

Summary
Digital forensics is a science about investigation where digital equipment
is used to acquire relevant data for criminal investigations.
On the market, we encounter new devices, software, and services which
could be the object or tool for committing a cyber-crime, which in order
to be solved requires a specific knowledge to conduct a criminal
investigation.

Knowledge acquired
The difference between different digital forensic types. History of
forensics and digital forensics.

Review questions
1. Explain the difference between computer and digital forensics.
2. Define digital forensics.
3. What are the types of digital forensics?
4. What is the incident response and what triggers it?
14

�5. Why is digital and computer forensics important?
6. What is digital evidence?
7. What are the basic steps of digital forensics?

Further readings
-

US CERT Cyber forensic,
https://www.uscert.gov/sites/default/files/publications/forensic.pdf

-

A Beginners Guide to Computer Forensic
http://ithare.com/a-beginners-guide-to-computer-forensic/

Video resources
-

How the Feds Caught Russian Mega-Carder Roman Seleznev
https://www.youtube.com/watch?v=6Chp12sEnWk&amp;t=2529s

-

Cyber forensic
https://www.youtube.com/watch?v=2D5wTo1adbg

-

What is cyber forensic
https://www.youtube.com/watch?v=lxUN-fOIe00

-

What is cyber forensic, Smithsonian Channel
https://www.youtube.com/watch?v=BSyi6yMIB0s

15

�16

�2. Digital forensics – classification

Chapter abstract
Chapter goals: To present different computer and digital forensic types
based on data source used for the digital forensic investigation. To
explain each recognised class of forensic investigation.
Learning outcomes: Knowledge of the core forensic classification and
data such as database log files important for conducting the forensic
investigation.

Digital forensic classification based on data source
Based on data source and scope of digital forensic explained in the
previous chapter, digital forensics can be classified as following: general
computer system forensics, database forensics, forensics of multimedia
devices, forensics of general computer systems, mobile device forensics,
and network forensics.
17

�Figure 6. Digital and Cyber forensic types

Forensics of general computer systems
Computer systems are built on components such as motherboards,
memory, hard drives, monitors, and DVD. Depending on usage and
mobility, systems can be on laptop, home computer, work computer, and
server in the enterprise environment. These systems can have an
abundance of interesting digitally stored information needed for a
potential forensic analysis. Investigators can obtain written documents
with dates of creation, e-mail correspondence, pictures, messages, etc.
This information can be used to determine the timeline of events and
involved actors. (Casey, 2011).
18

�Database forensics
Database forensics relies on data stored in databases and files used by
database management system (DBMS).
Paul M. Wright (2007) defined major sources of evidence in Oracle
database which can be considered when performing database forensics:
Listener log – This log stores the name of the listener, protocol, and
communication port used for accepting connections, nodes allowed to
connect to database, database services, and control parameters.
Alert log – This log stores starting and halting database, errors connected
to data storage, etc.
Sqlnet log – The purpose of this log is to keep track of an unsuccessful
access to a database. Forensic analyst has to check this log to discover
potential unauthorised attempts to access database. This log can provide
useful information about the source address of the connection
establishment attempt.
Redo logs – This log holds history of all changes in a database. Every
redo log file has a redo record that represents the change made in a specific
block in database (Oracle, pp. 79) if Oracle archiving is activated
(Litchfield, 2007). Every change in a database is written on database
buffers in the system global area (SGA) memory. Buffers are stored either
by issuing COMMIT command, or they are stored every three seconds on
a disk in the file known as Online Redo Log by Oracle Log Writer
19

�background process (LGWR). There is a possibility that these logs can be
filed up and log files rewritten with new entries. To be able to recover
important logs from database and avoid deletion of important logs it is
necessary to activate Archive (ARCn) option in a database (Litchfield,
2007).
It is possible to check if archiving is turned on by issuing SQL query:
SQL&gt; SELECT VALUE FROM V$PARAMETER WHERE NAME =
‘log_archive_start’;
VALUE
-------TRUE
Value TRUE indicates that log archiving is activated, while FALSE
indicates that it is not enabled.
FGA (Fine Grained Auditing) audit log can be used for collecting data
about changes in a database. It tracks commands INSERT, UPDATE, and
DELETE, and other changes such as data movement in a database. All
detected activities are recorded in audit tables (Oracle Fine Grained
Auditing, 2019).
Nanda A. and Burleson (2003) wrote:
“The ability to check who actually handles objects, not just who has
authority is provided by auditing. A good auditing system provides a
20

�process for recording the access to the objects in a storage system,
forming an audit trail”
(Oracle DBA_FGA_AUDIT_TRAIL, 2019):
“Audit trail records created by Fine Grained Auditing can be captured
and analysed in Oracle Audit Vault and Database Firewall, automatically
alerting the security team about possible malicious activity.”
Audit tables contain information presented below (Oracle Fine Grained
Auditing, 2019):
DB_USER – database user which issued queries in database.
SESSION_ID – unique ID session.
TRANSACTION_ID – Transaction ID with which object is changed or
accessed.
OS_USER – Operating system user.
USERHOST – name of the computer (host).
OBJECT_SCHEMA &amp; OBJECT_NAME – scheme and table.
SCN – (System Control Number of the database) – defines when an audit
trail was generated.
SQL_TEXT – text SQL commands.
COMMENT$TEXT – additional comments linked to audit if they exist.
EXT_NAME – If users are accessing from the outside, their name is
displayed here.
TIMESTAMP – date and time of the audit.
The following are DBA_AUDIT tables that can be used for the forensic
analysis, and which can be listed by issuing SQL query:
21

�SELECT view_name
FROM dba_views
WHERE view_name LIKE 'DBA%AUDIT%' OR view_name LIKE
'USER%AUDIT%'
ORDER by view_name

DBA_AUDIT_EXISTS
DBA_REPAUDIT_ATTRIBUTE
DBA_REPAUDIT_COLUMN
DBA_AUDIT_OBJECT
DBA_AUDIT_SESSION
DBA_STMT_AUDIT_OPTS
DBA_AUDIT_STATEME
DBA_AUDIT_POLICIES
DBA_AUDIT_TRAIL
DBA_AUDIT_POLICY_COLUMNS
DBA_COMMON_AUDIT_TRAIL
DBA_FGA_AUDIT_TRAIL
DBA_OBJ_AUDIT_OPTS DBA_PRIV_AUDIT_OPTS
USER_AUDIT_SESSION
USER_AUDIT_OBJECT
USER_AUDIT_STATEMENT
USER_AUDIT_TRAIL
USER_AUDIT_POLICIES
USER_AUDIT_POLICY_COLUMNS
USER_OBJ_AUDIT_OPTS
USER_REPAUDIT_ATTRIBUTE
USER_REPAUDIT_COLUMN

Tables above contain data that indicate which, what, where, and when
specific user made changes. This information can be used for the forensic
analysis of Oracle database.
Forensic tools presented in Chapter 4. Digital forensics tools are used for
database forensic investigation to find specific evidence in a large volume
of data through different files and tables in a database.
22

�Forensics of multimedia
Multimedia such as audio, video, and pictures are sources of digital data
which can be used for the forensic analysis.
Most popular devices that hold multimedia content are smart phones,
however, other devices such as gaming consoles, TVs, PDAs, CCTV,
other video or audio recording, and even IoT devices are also multimedia
devices which can be used for the forensic analysis.
Watermarking
Watermarking of image is a process of identification of user who created
it as well as the original source of that image.
Digital signatures
Digital signatures are signatures which can be found in an electronic form,
and which indicate a specific originator of electronic data.
Mobile device forensics
Increased usage of mobile devices opens digital forensic area of mobile
devices.
Computer systems are not only in a form of desktops, laptops, or servers.
They are also produced in a form of small computers embedded into smart
cards, mobile devices, GPS devices, and car computers. Mobile
communication devices can contain personal information, messages,
photos, and locations. Navigations systems can reveal location
information of a person under the investigation. All those devices are
valuable sources of information, especially because embedded devices are
23

�usually small, and used on a daily basis and in the mobile environment
(Casey, 2011).
Network forensics
Modern life is embedded into communication systems by all means.
Humans,

computers,

and

sensors

all

communicate

through

communication networks. Pieces of information are always left in the
system logs, no matter what type of communication is used. Traditional
telephone systems and internet service providers can be valuable points
for the investigation of the digital evidence. Mobile service providers
transfer SMS/MMS messages and mobile internet interconnections, while
Internet service providers transfer e-mails. In addition to the exact content
of the communication channel, an additional log examination can give
more information about who, when, and to whom information is sent
(Casey, 2011).
Network forensics is performed in order to investigate network flows,
network traffic and network connections. To be able to collect and analyse
network traffic, traffic has to be recorded and archived for the later use.
In most organisations, this approach is not applied because it adds an
additional load on the already busy network administrators. Many
network devices such as switches, routers, and firewall have basic syslog
capabilities which provide network administrators with information about
established connections, and device operations. Syslog functionality
cannot provide information about data payload inside network packets.

24

�Summary
Cyber security is a subset of information security that deals with the
security of information stored in a digital form and transferred over
communication links. A great part of information security related
standards deals with cyber security issues.
Almost on a daily basis, media reports reveal cyber security related
incidents. After the historical analysis, we can conclude that we will see
an increase in incidents of this type, especially as more services and users
use digital technology in everyday work and life.

Knowledge acquired
The difference between digital forensics classification types that includes
Forensics of general computer systems, Database forensics, Forensics of
multimedia, Watermarking, Digital signatures, Mobile device forensics,
Network forensics.

Review questions
1. What is watermarking?
2. Name digital and cyber forensic types.
3. What is network forensics?
4. What is mobile device forensics?

Further readings
-

Network forensics
https://www.itpro.co.uk/cyber-attacks/31660/what-is-networkforensic

25

�Video resources
-

Advanced Wireshark Network Forensics – Part 1/3
https://www.youtube.com/watch?v=e_dsGhvq9CU

-

Network Forensic Data Theft Detection, Under the Hood
https://www.youtube.com/watch?v=CYRYmKhz3QI

-

Mobile Device Forensics
https://www.nist.gov/sites/default/files/documents/2017/05/08/aa
fs-mobiledeviceforensic.pdf

-

Forensics, SANS
https://www.sans.org/readingroom/whitepapers/forensic/paper/32888

26

�3. Digital forensics – process

Chapter abstract
Chapter goals: To define digital forensic process which includes
Preservation, Handling evidence at crime scene, Collection, Transport,
Examination, and Analysis of digital evidence. This chapter briefly
explains media analysis, file system analysis, network analysis,
application analysis, OS analysis, executables analysis, image analysis
video analysis, memory analysis, and reporting. It also provides the
explanation regarding digital evidence collection and data concealment.
Learning outcomes: Knowledge of core principles of digital forensics, and
different types of analysis.

Steps in the Digital Forensic Investigation Process
In order to successfully show evidence and defend legitimacy of the entire
forensic process, it is necessary to perform every step of forensic
investigation with sound science methods. Courts will not accept evidence
if forensic process was jeopardised with negligence in evidence handling,
27

�preservation, and transportation. Forensic investigators and examiners
must be well trained and certified for forensic investigations. All actions
in the forensic investigation process have to be well documented through
policies and procedures. Every digital forensic investigator or agency has
to follow digital forensic steps, so that reports are admissible at the courts
of law.

Preservation

Collection

Examination

Analysis

Reporting

Figure 7. Steps in the Digital Forensic Investigation Process
28

�One of the main approaches in forensic investigation is to follow welldefined and accepted digital forensic investigation steps (Kaur and Kaur,
2012):
-

Preservation

-

Collection

-

Examination

-

Analysis

-

Reporting.

In Appendix – Digital forensic process are presented steps for forensic
process.
Preservation
In the preservation phase, all evidence has to be properly documented to
avoid any prior change of the crime scene. Crime scene has to be secured
so that nothing is changed when investigators enter the scene.
Digital forensic investigators are focused on finding and preserving digital
evidence, however, it is also possible that other forensic skills are needed
to collect biological samples such as fingerprints, DNA, etc. All
mentioned evidence has to be detected, documented, and preserved in the
original form, if possible, to avoid jeopardizing data and evidence
integrity. Depending on available information it is possible that digital
devices are contaminated with hazardous material. In that case other
forensic investigation specialists might be needed.

29

�If a device such as PC or a mobile device is found switched off, and
somebody turns it on as a part of digital forensic process, they may cause
a change of potential evidence on that device, in which case evidence
would lose its integrity and it would not be valid (Kaur and Kaur, 2012).
Massachusetts Digital Evidence Consortium (2015) explained in their
publication that first responders have to perform evidence preservation
and collection with a special care. Crime scene has to be investigated with
forensic methods only if law enforcement agencies approve such process.
All digital evidence such as hard disks has to be secured from the high
temperature, high electromagnetic fields, and moisture. This is because
such external influence can destroy potential evidence.
Forensic investigators are responsible for documenting the crime scene by
taking photographs and making video recordings of the scene. It is useful
to sketch the scene and keep records about investigators who were on the
scene as well as their responsibilities. It is also suggested to ask owners of
devices if they are willing to cooperate, and if they give their consent
investigators can request passwords, PIN, or other security features.
Device owner has to sign consent form with authentication methods and
passwords. Owner has to provide information of other possible
authentication methods such as face, fingerprint, or other biometric
recognition methods used for the authentication.
At the end of this book the Appendix – Consent form is an example of the
consent form created based on Massachusetts Digital Evidence

30

�Consortium (2015) documentation. If the consent is not given, suspects in
many jurisdictions will be fined.
The chain of custody has to be kept through the entire process. Digital
evidence must be secured at all times, so that all activities performed
during seizure, access, storage, and transfer can be completely
documented, preserved, and authorized. Documentation which proves all
of the above has to be available for the review. It needs to be emphasized
that individuals are fully responsible for digital evidence while evidence
is in their custody.
It is important to determine if devices are switched on or off.
If a device is switched on and then switched off, data about active
connections or data from volatile memory would be lost. This is a way in
which forensic investigators have to check if the device produces
vibrations due to HDD operation, other sounds, and lights. Device has to
be accessed with caution, by isolating it from networks such as wired,
wireless, and GSM. If possible, device has to stay powered to collect all
available passwords.
If a device is turned off and then switched on, potential evidence would
be lost. Thus, the device has to be packed and prepared for the
transportation.
Collection
Collection is the process of detecting and collecting evidence relevant for
the forensic investigation. Because most of data is stored on media such
31

�as hard disk, memory cards, and other removable media, it has to be
duplicated: cloned and/or copied to media that will be used in the forensic
investigation process. Forensic investigators should not change collected
evidence, because in that way the investigation process would be
compromised. Sources such as seized hard disc have to be secured and
kept in custody while investigation is performed with cloned data (Kaur
and Kaur, 2012).
Transport
There is a risk associated with a transport of digital evidence because its
confidentiality, integrity, and availability can be jeopardized. Therefore,
it is important that digital forensic investigators be well educated and
aware of the risk associated with digital evidence transportation. Digital
evidence has to be delivered to forensic laboratory in the shortest time
period, and protected from external influences depending on inherited
weakness of specific digital device or asset (Law Enforcement Cyber
Center, 2017).
Examination
Process that defines which methods and tools have to be used in the digital
forensic process is called the examination. Different devices which hold
digital evidence may require different tools and methods for acquiring
forensic evidence. All activities in the examination process have to be
performed on cloned and copied data (Kaur and Kaur, 2012).

32

�Analysis
Analysis refers to the process of using examined data and placing findings
from the examination stage in the context for the digital forensic report.
In the analysis process, available data is used to determine meaning of that
data, i.e. how it was created or transferred to or from a device, and what
story data tells forensic investigators. In the analysis process, forensic
investigator has to acquire information about data ownership, potential
hidden data, file, or application.

Types of Digital Evidence Analysis
Due to a different source and scope of data usage, digital forensic
investigators are able to conduct different types of digital forensic
investigation (Carrier and Spafford, 2004).
Examples of digital forensic analysis reported by Carrier and Spafford
(2004) are the following:
 “Media analysis


Media management analysis



File system analysis



Network analysis



Application analysis
o OS analysis
o Executable analysis
o Image analysis
o Video analysis



Memory analysis”

33

�These types of analysis can be applied to computer as well as mobile
devices.
Media analysis
Media analysis refers to the analysis of storage media. It does not consider
any partitions or other operating system-specific structures. Storage media
can be USB drive or disk, and SD cards for cameras or mobile devices
(Carrier and Spafford, 2004).
Media management analysis
Media management analysis focuses on media logical organization, such
as combining more disks into one logical volume. An example of
combining more disks into a logical volume is mirroring of two physical
disks into one logical disk. Mirroring disks in such manner means that one
chunk of information is written on both disks at the same time. In case of
one disk failure, another one continues to operate (Carrier and Spafford,
2004).
File system analysis
File system analysis is the analysis of the system data inside the disk or
deleted files in order to extract the contents of the file (Carrier and
Spafford, 2004). File system takes care of the files written across the
available partition. In case a file is deleted, it is usually marked deleted,
signalling to other processes that location is free to record the next data.
When deleted files need to be recovered, special tools can be used to locate
file fragments and rebuild them to a useful file.

34

�Network analysis
Network analysis refers to the analysis of the data inside protocol layers
(Carrier and Spafford, 2004). Network analysers can be used to
reconstruct raw data packets into application layer information.
Communication level is essential to reconstruct possible scenarios of user
or computer interactions, and it is a very valuable source of information.
Application analysis
This type of analysis analyses data information inside the files and
application. Files are created by the user, and format of the content is
application-specific such as text documents or photos.

Figure 8. Application analysis

35

�Some special types of a common application analysis are:
o OS analysis
o Executable analysis
o Image analysis
o Video analysis
Operating System (OS) analysis
OS analysis is the operating system-specific analysis of the configuration
and events during usage (Carrier and Spafford, 2004). OS communicates
with hardware and upper layers. All interaction details such as errors,
warnings, different types of events as well as configuration, are recorded
and stored inside OS compartments. This information can help build the
overall digital landscape.
Executable analysis
Executable files can cause events and they are noticed when executed as
processes. Executables such as malwares are common for the analysis
during the intrusion investigation (Carrier and Spafford, 2004).
Image analysis
Image analysis refers to the analysis of the person recorded on image,
location, or timestamp. Image analysis includes the analysis of the
potential steganography information (Carrier and Spafford, 2004).
Video analysis
Video files are the subject of the analysis of surveillance cameras, web
camera, and smart phone camera. Same as image analysis, video analysis

36

�leads to information about person, location, or timestamp (Carrier and
Spafford, 2004).
Memory Analysis
Memory analysis can reveal very useful information, because it is used
for dynamic operations and storage of temporary results.
Operating systems use two types of memory:
a) The volatile memory (RAM) is a fast memory used for dynamic
operations. It stores data until device is switched off. The main
function of volatile memory is to store application and system data
during runtime, which contain information such as password,
usernames, session data, encryption keys, data about activities and
network, etc.
b) The non-volatile memory refers to the internal storage such as
flash memory and equipment extensible storage device known as
the SD card. This type is mainly used for static data storage such
as application and system data, user settings, and data files. Data
is stored even after device restarts or powers off.
Reporting
Reporting is the final word about findings. Examiner is responsible to
write an accurate and complete report on findings and analysis of the
digital information and device. In addition to findings and analysis, it is
important to have accurately documented steps taken during all phases of
the investigation.
General suggestions for the information that could be included in the
report is the following (National Institute of Justice, 2004):


Identity of the reporting agency



Case identifier or submission number
37

�

Case investigator



Identity of the submitter



Date of receipt



Date of report



Descriptive list of items submitted for examination, including
serial number, make, and model



Identity and signature of the examiner



Brief description of steps taken during the examination, such as
string searches, graphics/image searches, and recovering erased
files



Results/conclusions

Digital Evidence Collection
Every digital forensic investigator must be aware of the entire context of
digital surroundings and other sources of evidence at the crime scene.
Every digital device, if accessed in an improper manner, can cause data
change and evidence loss. Data can be in form of network connections,
processes, memory data, and data on hard disk or peripheral memory, or
in volatile and non-volatile memory. Data written on mobile device
memory cards, hard drive, and external memory storage can be considered
as static memory or non-volatile, while data written in RAM is considered
as volatile memory.
With this in mind, it is important to distinguish states in which data can
be found. Furthermore, digital forensic investigator has to be careful in
approaching data collection phase.

38

�Computer or other digital devices which are recognised at the crime scene
must be approached with care. Crime scene has to be preserved and
documented using sketches and photos, and if computer or other digital
devices are found, their power status must be checked.
Hard drive data will remain on media after a device is powered off and
that data can be cloned and duplicated. Data in RAM will disappear after
device is turned off. This includes information such as running processes,
network connections, and system settings (Nelson, Phillips &amp; Steuart,
2015). This is the way in which two major approaches have to take care
of live data and post-mortem data acquisition.
Live Data collection
Tools for the acquisition of data in volatile memory can copy data from
volatile memory and transfer it to the forensic location on non-volatile
memory for the later analysis. Data from volatile memory or system can
also be copied with the goal to collect information such as established
sessions, running processes, network processes, passwords, and
connected users.
Live acquisition is done if a digital forensic investigator decides to collect
all available data in volatile memory from the crime scene. Digital
forensic investigator needs to be aware that any access to running system
can change data and destroy evidence on that system.
Data acquired from volatile or non/volatile memory has to be copied or
cloned on a disk which will be used for the forensic analysis. During this
39

�phase, all data dumps must be saved on a separate disk and calculated with
hash functions such as SHA512 to be able to have a guaranteed evidence
integrity. All results from hash calculation such as SHA512 have to be
saved for the later use.
Data that can exist in a volatile memory is the following:
-

Information about running processes, network sessions, and
services

-

Unpacked/decrypted versions of protected programs

-

Running malware/Trojans

-

Cloud service information

-

System information (system uptime, system inventory, etc.)

-

Information about logged in users

-

Registry information

-

Open network connections and content of ARP cache tables

-

Social networks information

-

Online communication (Viber, Skype)

-

History of Web browsing activities

-

Information about an access to Webmail systems

-

Decryption keys for encrypted volumes mounted at the time of the
capture

-

Recently viewed images

Information about running process, open network connections, and
evidence will not remain after the process is completed, which is due to
volatile memory data limitations. However, with types of data such as web
browsing history, online chats will not disappear instantly after the end of
40

�communication. System or its user can overwrite data (Afonin and
Gubanov, 2013).
Post-mortem data collection
Digital device which is powered off is ready for the post-mortem data
acquisition. Only approved tools for data imaging are used for the postmortem forensic data acquisition. For data acquisition it is necessary to
make a clone and perform the forensic analysis with cloned and copied
data while original media stays intact in the safe place with calculated
hash value such as SHA512. Devices which prevent changes on the
original device with data are called write blockers. This type of devices
disables writing on the original storage media. Direct access to disk plates
and memory chips is enabled if a device is damaged. Forensic computer
which has tools and ports able to access external devices with cloned data
is used for accessing data on the cloned disk.
Completeness and accuracy are two critical measurable attributes of the
acquisition process.
While completeness quantifies whether all the data was acquired,
accuracy quantifies the correctness of acquired data.
In order to achieve completeness and accuracy in copying data from the
original source, bit-for-bit copy and bit-stream duplicate data from the
original data source to destination memory location. Bit-for-bit can be
used with specialized tools, while bit-stream can be performed with the
computer (NIST, 2004).
41

�Data concealment
It is not possible to investigate data which is not available and visible to
the investigator. Thus, criminals and wrongdoers employ different
techniques to destroy and hide evidence (Marcella A. J. and Menendez
D., 2008).
Spoliation
Spoliation is an act of destroying or changing evidence with the goal to
make evidence unusable.
Encryption
Encryption is a process of converting data and files into cryptic form so
that data can be accessed only by using passwords for symmetric
encryption and using private and secret keys if asymmetric encryption is
used.
Steganography
Steganography is the process of hiding data such as messages into existing
files which can be textual files, pictures, and video files. Various tools are
being used for performing data concealment in data files.
One of the well-known tools for hiding messages in data files is snow tool
(SNOW, 2019) which uses whitespace steganography practice. This
program is used:
“to conceal messages in ASCII text by appending whitespace to the end
of lines. Because spaces and tabs are generally not visible in text viewers,
the message is effectively hidden from casual observers. And if the built42

�in encryption is used, the message cannot be read even if it is detected.”
(SNOW, 2019)
For the purpose of explaining the process of hiding the text inside the file,
“sample_file.txt” was created with the content shown in Figure 9.

Figure 9. Sample_file.txt content

Issuing snow command with flags –C program snow compresses the data
if concealing, or uncompresses it if extracting the file. (SNOW, 2019)

43

�Figure 10. Creating concealed message in sample_file1.txt content

In Figure 11. it is possible to see content of the new file “sample_file1.txt”
after issuing the type command. Figure 11. also shows in “cmd” editor
that additional space is added but no content is visible.

Figure 11. Creating concealed message in sample_file1.txt content

44

�Figure 12. shows an unsuccessful attempt to read a concealed message
without the password as well as a successful attempt by providing the
password with “-p” flag that is “secret_password.”

Figure 12. Reading concealed message in sample_file1.txt content

To make it harder for the investigators to find concealed data, it is possible
to replace the original with the file which contains a concealed message
by deleting the original file, and renaming the file with concealed message
with an original file name.
Figure 13. shows the size difference between “sample_file.txt” and
“sample_file1.txt.” Due to such calculation of files, hash is the technique
which can be used to detect if somebody, in person or by using a malicious
program, changed the content of the files.

45

�Figure 13. File sizes comparison

Summary
With a goal to successfully present forensic findings, it is necessary to
conduct forensic investigation with care and by the latest forensic
investigation advancements.
Every forensic investigator has to know that suspects can hide data using
different techniques such steganography, encryption, or simply by
destroying data.
It is important to emphasize that before the analysis, data has to be copied.
The preferred action is to clone data from the original media to avoid
deletion of the original data.

Knowledge acquired
Common steps in the digital forensic investigation process that includes
Preservation, Collection, Transport, Examination, Analysis. Essential
knowledge of types of digital evidence analysis that includes Media
analysis, Media management analysis, file system analysis, network
46

�analysis, application analysis, operating system analysis, executable
analysis, image analysis, video analysis.
Memory Analysis, Reporting. Digital evidence collection that includes
Live Data collection Post-mortem data collection and data concealment
methods which can be used such as spoliation, encryption, and
steganography.

Review questions
1. Explain common steps in the digital forensic investigation
process.
2. Name digital evidence collection methods?
3. What is image analysis?
4. What is video analysis?

Further readings
-

Digital transformation: online guide to digital business
transformation https://www.i-scoop.eu/digital-transformation/

-

The Cyber Security Management System: A Conceptual Mapping,
SANS Institute InfoSec Reading Room
https://www.sans.org/reading-room/whitepapers/basics/cybersecurity-management-system-conceptual-mapping-591

Video resources
-

Computer Forensic Investigation Process

https://www.youtube.com/watch?v=NmuhGa4QekU
-

Overview of Digital Forensics

https://www.youtube.com/watch?v=ZUqzcQc_syE

47

�48

�4. Digital forensics – tools

Chapter abstract
Chapter goals: To present forensic tools and explain for what purpose
they can be used in digital forensic process investigation. Digital forensics
covers different technologies and components, hence, different and
specialised digital forensic tools exist, namely for database forensics,
network forensic, and mobile devices.
Learning outcomes: Knowledge of digital forensic tools and how they can
be used.

Digital Forensic Tools
To achieve desired results, scope of the investigation must be defined first.
Defining scope will also determine what the investigator is looking for,
how to reach those locations and information and which tool has to be
used. Concerning forensic tools, there are many ways to reach the same
goal. This section will focus only on Android tools needed to perform the
necessary steps.
49

�Hardware digital forensic tools and their usage
Hardware tools are necessary for accessing data on devices such as hard
drives or mobile devices. One of the most important aims is to clone data
from original digital devices and provide the exact digital copy which will
be used for the investigation.
Usage of hard disk docking stations
Hard disk docking stations should be in the arsenal of every digital
forensic investigator.
This type of devices should be able to access different types of disks which
can be found in laptops, personal computers, and servers. It should also
have the clone function for cloning HDDs without laptop, PC, or server
to prevent losing or changing files of suspects.

Figure 14. Hard disk docking station (Renkforce, 2019)

50

�Usage of memory card docking stations
Many devices such as smart phones, laptops, and CCTV cameras hold SD
memory and other types of memory cards which have to be investigated.

Figure 15. Memory card docking station (Logilink, 2019)

Memory card docking station is used to read data from memory cards
taken from the device.
Usage of Portable Computer Forensic Lab
Figure 16. shows the specialised all-in-one case called Road Master (Road
MASSter 2, 2019).

51

�Figure 16. Portable Computer Forensic Lab Road MASSter 2, 2019

The Road Master is capable of high-speed forensic data acquisition
operations used to access external devices.

Usage of General Computer forensic tools
Different hardware and software tools are used to preserve and collect
crucial data for the forensic analysis process.
Disk Genius usage
DiskGenius is a software with functions able to recover partitions and
make data backups, and it has other disk utilities required for the disk
management.
It can manage storage space, deletion acts, and virus attack; it also has the
formatting function, and recovers data lost due to the disk corruption, etc.,
and it provides the backup to prevent data loss.

52

�Figure 17. Disk Genius

DD command tool usage
Mobile device, computer, or any other digital device found at the crime
scene can be a subject of the post-mortem data acquisition. This is a way
of collecting data information on devices found switched off. Since a
device if off, volatile data in memory is not available, but data stored on
a hard drive/solid memory is a very valuable source of information.
Investigator must make an image of a hard drive or mobile device solid
memory or some other storage devices.
Linux command line dd is used to copy the content of a seized device.
Example of dd usage is: dd if=/dev/sda of=/dev/sdb and it copies the
content from /dev/sda to the /dev/sdb destination.
53

�Busybox usage
Busybox is a toolset based on many UNIX utilities. Utilities are combined
into a small executable. Busybox provides a usable environment for small
or embedded systems. It is very modular, and it is made for limited
resources. Busybox set of commands makes access to the system at a
lower level making environment more accessible. It is available for
download on https://busybox.net/.
Hash Calculation
Calculation of file hashes must be done immediately after the acquisition
of digital information. It ensures the integrity of the collected data. It is
usually a solid memory image or a separate file.
Linux commands used for generating hash values are sha256sum or
sha512sum. SHA256SUM uses 32-bit blocks, while SHA512SUM uses
64-bit blocks.
Figure

18.

is

an

example

of

generating

usb_modeswich.conf file using both generators.

Figure 18. Calculating Hash Value
54

hash

values

of

�Database tools usage
The following passages present tools which can be used for the database
forensic process.
Usage of the Oracle LogMiner
Oracle LogMiner, (2019) is a tool that can be used for digital forensic
investigations.

Figure 19. Q Capture program works with LogMiner to retrieve
changed data IBM Knowledge, Center, 2013

It allows the analysis of changes to be performed in a database, and
provides the rollback function for data including errors made by users.
Figure 20. shows how with LogMiner it is possible to view and save redo
logs, as well as create and execute queries to find specific actions using
GUI. It also shows query for a specific time and database user.
55

�Figure 20. View all transactions for user, Nanda A., 2019

As a result, Oracle LogMiner created an initial report which shows
database user activity.

Figure 21. LogMiner results, Nanda A., 2019

By opening transactions detail, it is possible to see which query a specific
user issued. LogMiner can be used for acquiring data on usage of data
manipulation language (DML) which is a programming language used in
a database for adding (inserting), deleting, and modifying (updating) data.
The goal of using the Oracle LogMiner is to find DML statements for the
post-mortem forensic investigation.
56

�Figure 22. LogMiner results, Nanda A., 2019

LogMiner can be used for an offline analysis of archived redo logs on a
separate database.
Usage of the IBM Guardium Data Protection for Databases
IBM Guardium (2019) Data Protection for Databases is a forensic tool
used to protect database from an unauthorised access. It detects unusual
activities on sensitive data. It provides a real-time monitoring and alerts
on suspicious activities.

Figure 23. IBM Guardium (2019) Navigation Overview

IBM Guardium provides a preventive protection, but it can also be used
for database forensic investigations which need to show if the user or
administrator committed a suspicious or criminal activity.

57

�Figure 24. IBM Guardium (2019) Out of the box creation

Usage of the DB Browser for SQlite
Even small devices such as mobile phone, tablet, or embedded systems
based on Android operating system utilize databases needed for services
they are produced for. Regardless of whether data is structured or
repeating, Android stores data in the SQLite database. SQlite is an
embedded SQL database engine. Unlike other, SQL databases does not
have a separate server process, which means it reads and writes directly
to disk files. The entire database is contained in a single file located on a
disk. Considering that size of the library is approximately 300-500 KB,
and it is made to run under a minimal stack space (4KB) and heap
(100KB), SQLite is ideal for devices struggling with memory space such
as tablets, GPS navigations, MP3 players, etc. It is free for use regardless
of being commercial or a free project.
Since each Android device consists of more databases of this type, for the
forensic investigation, it is helpful to have a tool for a direct access to
database. One of such free tools is DB browser for SQLite shown in
Figure 25.

58

�Figure 25. DB Browser for SQLite

Usage of the Undark - a SQLite data recovery tool
Undark is a data recovery tool for SQLite databases. It is useful to retrieve
deleted data from the database file. Chances to recover a useful set of data
are minimal if database is defragmented and vacuumed. Undark relies on
the fact that actual data is not purged immediately when the process of
deletion started, because there could be active transactions which could
still access the old version of the record. It is rather performed at a later
stage when system does periodical checks for the old data record.
Download is available at GitHub https://github.com/inflex/undark.
Undark capabilities are to:
-

Retrieve most available records from the SQLite database;

-

Deposit actual records;

-

Recover deleted records;
59

�-

Retrieve data from a corrupted SQLite database.

The command to convert the recovery SQLite database broken.db into
recover.csv file format is:
undark -i broken.db &gt; recover.csv

Recover.csv file will be filled with actual and recovered records from
broken.db.

Usage of the SQLite-Deleted-Records-Parser
This is another useful tool used to recover SQLite deleted records. It is
simple to use, but results are valuable in recovering deleted data from an
unallocated

space.

Download

is

available

on

https://github.com/mdegrazia/SQLite-Deleted-Records-Parser.
Command for its usage is:
sqlparse_CLI -p -f source.db -r -o dbreport.txt

Usage of the Network forensic tools
Different network forensic tools can be used, however data and session
traffic have to be captured and stored in order to have all relevant
information available for forensic purposes.
Wireshark usage
Wireshark is a popular tool for capturing and analysis of the network
traffic.
60

�Control Port 21

FTP Client
Data port 20

FTP Server

Figure 26. FTP connection

Figure 27. shows the captured Wireshark traffic for the FTP session
initiation with an entered username and password as an example of how
the unencrypted traffic can be captured for a later analysis.

Figure 27. Captured FTP connection with Wireshark

61

�NIKSUN NetDetector usage
NIKSUN NetDetector (2019) is capable of a dynamic application
recognition, and it has integrated anomaly and signature-based IDS, data
leakage prevention, real-time surveillance and application, and session
reconstruction. NetDetector web site is the following:
https://www.phoenixdatacom.com/product/niksun-netdetector-packetcapture-network-security-forensics/

Figure 28. NIKSUN NetDetector, 2019

Xplico usage
Network forensic tool Xplico is an open source software used for the
analysis of network sessions. Xplico web site is https://www.xplico.org/

62

�Figure 29. Xplico (2019)

Usage of the Mobile device forensic tools
General forensic tools for computer system and database tools can be used
to perform the forensic analysis of mobile devices.
Rooting Tools usage
Investigator needs to decide what type of rooting needs to be performed,
with or without a computer. Whatever the choice is, it should produce the
same result, which is for a device to be rooted. However, a higher success
rate is expected for the computer driven process. If a device needs to be
rooted without the computer, a special crafted apk package needs to be
downloaded and installed directly to the Android device. Very commonly
used tool to root over the computer is Kingo Root (Figure 30).

63

�Figure 30. Kingo Android Root

If the rooting process needs to be performed without the computer, then
this task can be done with an application named TowelRoot. Software can
be downloaded at https://towelroot.com/
Santoku usage
Santoku is a Linux based platform used for various security related
activities. Operating system comes with the pre-installed platform
Software Development Kits (SDK), drivers, and utilities.
Santoku auto-detects and sets up new connected mobile devices, saving
time for investigation tasks. A graphic User Interface (GUI) tool makes
an easy deployment and takes control of mobile applications and
investigation tools as shown in Figure 31.

64

�Figure 31. Santoku Linux

The installation is free for download at http://santoku-linux.com (Figure
32), and the platform can be installed on hardware or in the virtual
environment.

Figure 32. Santoku Linux Download

65

�The main aim of Santoku platform is:
Mobile Forensics
Tools to acquire and analyse data
Firmware flashing tools for multiple manufacturers


Imaging tools for NAND, media cards, and RAM



Free versions of some commercial forensic tools



Useful scripts and utilities specifically designed for mobile
forensic

Mobile Malware
Tools for examining mobile malware
-

Mobile device emulators

-

Utilities to simulate network services for dynamic analysis

-

Decompilation and disassembly tools

-

Access to malware databases

Mobile Security
Assessment of mobile applications
-

Decompilation and disassembly tools

-

Scripts to detect common issues in mobile applications

-

Scripts to automate decrypting binaries, deploying apps,
enumerating app details, and more.

66

�AF Logical OSE usage
AFLogical OSE is an open source tool used for a simple logical
acquisition of data from the Android device. It can be found already
compiled in Santoku Linux distribution (Figure 33).

Figure 33. AFLogical OSE

Autopsy and the Sleuth Kit usage
The Sleuth Kit is an open source digital forensic set with the collection of
command line tools. Autopsy is a graphical interface (Figure 34.) for the
Sleuth Kit and it provides an easy usage of available tools. It also provides
case management, image integrity, keyword searching, and other
operations without the need for an external software.

67

�Figure 34. Autopsy Main Operations Screen
Image Import and Supported Image Formats

Autopsy can analyse raw, dd, or E011 format of disk images and local
drives, or a folder of local files. Before the analysis, investigator is
required to choose which type of data source is the source of information
(Figure 35.). Forensic investigator can select Disk Image or VM File
obtained with available methods, attached Local Disk, already prepared
Logical Files, or Unallocated Space Image. It is possible to use a file taken
out of the disk image section for an additional investigation.

1

The popular commercial forensic suite, EnCase, developed a proprietary format called EnCase Evidence
File format. EnCase Evidence Files use the file extension, E01, and are based on the Expert Witness Format
(EWF) by ASR Data (Forensicwiki, 2012). These image files are commonly referred to as Expert Witness,
E01 or EWF files.- (https://www.sans.org/reading-room/whitepapers/forensic/forensic-images-viewingpleasure-35447,10.1.2018)

68

�Figure 35. Type of Data Source
Analysis Features

Below is the list of Autopsy features.


Multi-User Cases: Collaborate with fellow examiners on large
cases.



Timeline Analysis: Displays system events in a graphical interface
to help identify the activity.



Keyword Search: Text extraction and index searched modules
enable you to find files which mention specific terms and find
regular expression patterns.



Web Artefacts: Extracts web activity from common browsers to
help identify user activity.



Registry Analysis: Uses RegRipper to identify recently accessed
documents and USB devices.



LNK File Analysis: Identifies shortcuts and accessed documents.



Email Analysis: Parses MBOX format messages, such as
Thunderbird.

69

�

EXIF: Extracts geo location and camera information from JPEG
files.



File Type Sorting: Group files by their type to find all images or
documents.



Media Playback: View videos and images in the application and
there is no need for an external viewer.



Thumbnail viewer: Displays thumbnail of images to help view
pictures quickly.



Robust File System Analysis: Support for common file systems,
including NTFS, FAT12/FAT16/FAT32/ExFAT, HFS+, ISO9660
(CD-ROM), Ext2/Ext3/Ext4, Yaffs2, and UFS from The Sleuth
Kit.



Hash Set Filtering: Filter out good known files using NSRL and
flag bad known files using custom hashsets in HashKeeper,
md5sum, and EnCase formats.



Tags: Tag files with arbitrary tag names, such as 'bookmark' or
'suspicious', and add comments.



Unicode Strings Extraction: Extracts strings from an unallocated
space and unknown file types in many languages (Arabic, Chinese,
Japanese, etc.).



File Type Detection is based on detection of signatures and
extension mismatch.



Interesting Files Module will flag files and folders based on name
and path.



Android Support: Extracts data from SMS, call logs, contacts,
Tango, Words with Friends, and more. (The Sleuth Kit, 2018)

70

�Ingest Module usage
Ingest Module is a very helpful and powerful feature. During the initial
case setup, it offers selection of needed ingest modules as shown in Figure
36. It identifies files and extracts known data as records. Examples of
those records are emails, SMS messages, etc. Analysis of time and disk
space may vary depending on how many modules are selected. It is
important to have an Android Analyser module selected if an Android
device image is an object of the import.

Figure 36. Autopsy Ingest Module
71

�Android Analyser module usage
This module helps identify files and present data containing contacts,
messages and other communications records, web history, web
bookmarks etc. It gives an option to manually tag findings for different
types of categories such as Child Exploitation. Figure 37. shows which
types of categorization can be found on the main screen.

Figure 37. Android Analyzer
72

�Accessing Partitions
Beside an automatic search for interesting records, it is possible to access
image partitions manually. This offers another view to the acquired data,
having a flexible approach to the offered data structure. Figure 38. shows
all partitions acquired by the physical acquisition.

Figure 38. Access to Imaged Partitions

73

�Timeline
Timeline option offers a powerful overview of the recorded events in time
domain. With filtering options, timeline makes context building in View
Mode Counts easier (Figure 39.).

Figure 39. Timeline – View Counts

Colours represent main types of event categories, File System, Web
Activity, and Misc. Types (Figure 40.). This filter is useful when many
events are presented, thus allowing the focus on the interesting ones.

74

�Figure 40. Filter Events Categories

When the View Mode is set to Details, it is possible to see and pin a
potential interesting event. Figure 41. shows SMS and pinned messages.

Figure 41. Timeline - View Details
75

�Reporting
Autopsy offers an option of generating reports in various formats (Figure
42.). The final report will include either all analysis results or only tagged
ones. When a large amount of data is generated, Excel format report gives
more flexibility in case that data needs to be exported further.

Figure 42. Report Formats

Generated report is filled with the case summary as shown in Figure 43.

76

�Figure 43. Report - Case Summary

Figure 44. Report - Tagged Images

Figure 44. shows a detailed list of Keyword Hits, Tagged Files, Tagged
Images, or Tagged Results.

Summary
Cyber security is a subset of information security which deals with the
security of information stored in a digital form and transferred over
77

�communication links. A great part of information security related
standards deals with cyber security issues.
Almost daily, media reports reveal cyber security related incidents. After
the historical analysis, we can conclude that we will see an increase in the
frequency of incidents of this type, especially as more services and users
use digital technology in their everyday work and life.

Knowledge acquired
Digital forensics – tools and usage: of hard disk and memory card docking
stations, Portable Computer Forensic Lab, usage of general computer
forensic tools such as
Disk Genius usage, DD command tool usage, Busybox usage. Database
tools usage such as the Oracle LogMiner, IBM Guardium Data Protection
for Databases, DB Browser for SQlite, Undark - a SQLite data recovery
tool, SQLite-Deleted-Records-Parser. Usage of the network forensic tools
such as Wireshark usage, NIKSUN NetDetector, Xplico usage. Usage of
the mobile device forensic tools such as Rooting Tools usage, Santoku
usage, Autopsy and the Sleuth Kit, Ingest Module usage, Android
Analyser module and how to access partitions and use reports.

Review questions
1. Explain the difference between digital forensics tools.
2. Name tools for each technology?
3. Steps for mobile forensic investigation.

78

�Further readings
-

Digital transformation: online guide to digital business
transformation

https://www.i-scoop.eu/digital-transformation/
-

United States Secret Service:

Best Practices for Seizing Electronic Evidence
http://www.forwardedge2.usss.gov/pdf/bestPractices.pdf
-

National Institute of Justice:

Forensic Examination of Digital Evidence: A Guide for Law
Enforcement
https://www.ncjrs.gov/pdffiles1/nij/199408.pdf
-

National Institute of Justice:

Electronic Crime Scene Investigation: A Guide for First Responders,
Second Edition
https://www.ncjrs.gov/pdffiles1/nij/219941.pdf
-

National Institute of Justice:

Electronic Crime Scene Investigation: An On-the-Scene Reference for
First Responders
https://www.ncjrs.gov/pdffiles1/nij/227050.pdf
-

National Institute of Justice:

Digital Evidence in the Courtroom: A Guide for Law Enforcement and
Prosecutors
https://www.ncjrs.gov/pdffiles1/nij/211314.pdf
-

Department of Justice:

Searching and Seizing Computers and Obtaining Electronic
Evidence in Criminal Investigations
79

�-

http://www.justice.gov/criminal/cybercrime/docs/ssmanual2009.
pdf

Video resources
-

Disk Imaging/Acquisition Using Linux DD / DCFLDD command

https://www.youtube.com/watch?v=aJp7_OVW2FA
-

Computer Forensics: fdisk and dd

https://www.youtube.com/watch?v=nzRo8gh7wkA
-

Creating a Disk Image for Forensic Analysis

https://www.youtube.com/watch?v=zY1rblisrBQ
-

Starting a New Digital Forensic Investigation Case in Autopsy 4

https://www.youtube.com/watch?v=WB4xj8VYotk
-

Processing and analysis of disk images with Autopsy 4 default
modules

https://www.youtube.com/watch?v=FJqoUakfmdo
-

80

NIKSUN Netdetector https://niksun.com/notebook.php

�5. Simulation of digital forensic cases

Chapter abstract
Chapter goals: To present digital forensic investigation cases which deal
with the general computer, smart and mobile phones, and databases. To
provide an insight into real forensic investigation processes not limited to
single technology or a tool.
Learning outcomes: Knowledge of the possible ways in which digital
forensic cases can be performed explained in different case simulated
scenarios offering students a real hands-on experience from presented
cases.

Case 1: Forensic data recovery of files on PC
The goal of the forensic investigation was to find a specific file on a disk
on which windows quick-format was performed. There was no need to
acquire live data for this process, because disk had already been removed
from the PC.
81

�For this purpose, Disk Genius was first used together with the hard disk
docking station to clone the original disk to the investigation disk, and
then to copy cloned data to the local investigator’s forensic station.

Figure 45. Disk Genius access to the investigated hard disk

Figure 46. shows how data was copied from the cloned hard disk to the
local forensic investigator PC. All folders and files were available and
needed file was easy to find.

82

�Figure 46. Disk Genius data copy

83

�Case 2: Forensic investigation of Viber, VOICE CALL, SMS,
and Coco on an Android mobile device

While working with the law enforcement team as contractors, we came
across the case of two harassed persons. They were under the pressure
because they were harassed over digital channels such as Global System
for Mobile Communications (GSM) call, SMS text message, Viber
message and threatening photographs, and Coco messenger. Both of them
showed their Android smartphone devices with disturbing content.
Everything was documented in the file.
Local police arrested the suspect and seized his Android mobile phone
while following all the rules and procedures. The android mobile device
was labelled and shielded against the radio frequency radiation, thus
isolating the source of evidence, and transported to the laboratory.

Defining the Scope of the Investigation
Scope definition presents an important factor of the investigation. The
initial interview with reporting persons discovered some basic
information about events such as date and time, content, digital channel
etc.
Seized device in this particular case was Lenovo A2020a40 running
Android operating system version 5.1.1 equipped with GSM SIM card
+38761078857. Device did not have any external storage, nor was it

84

�locked or encrypted. USB debugging was enabled. Team collected all
available information from the first victim (referred to as person 1).
TABLE 2. Reporting Person 1 Data
Report 061abcdef
SMS message
Viber message
Viber photo

Content
Hi beauty, I saw you yesterday.
I’m in love with you.
Picture of message “Are you afraid
of the night?”

Viber call

Date time of receipt
3.2.2018 15:23
2.2.2018 10:27
3.2.2018 15:29
2.2.2018 10:31 duration 63 sec

Team also collected all available information from the second victim
(referred to as person 2).
TABLE 3. Reporting Person 2 Data
Report 062342097
GSM Voice call

Content
-

Coco message
Coco message

Careful with your door lock
You promised me not to leave me alone.
Now you will regret.

Date time of receipt
2.12.2017 11:37 duration 30
seconds
3.2.2018 15:25
2.2.2018 10:42

Both victims experienced unpleasant calls, messages, and photographs
delivered over:


Traditional voice GSM service



Traditional SMS GSM service



Viber Internet service



Coco Internet service

First of all, it was necessary to search for the evidence on the seized
Android device without knowing whether or not potential digital artefacts
were deleted. After an additional analysis, decision was made to search
for database files and photographs in both spaces – allocated and
especially unallocated – because it was assumed that perpetrator deleted
85

�all or some of the messages/calls/photographs. Goal was to find as much
evidence as possible against the attacker.

Preparing the Environment for the Data Acquisition
Workstation dedicated for the investigation must be equipped with
hardware and software needed for the image acquisition. Depending on
the type of image data acquisition, some prerequisites must be met.
Communication interface for the object of the investigation needs to be
ADB connected over the USB port. Since this scope is limited to gathering
logical images, some additional steps must be performed beforehand.
-

Verifying ADB interface

-

Root the device

-

Install Busybox set of utilities

Verifying ADB Interface
The installed ADB connector will act as a link between the workstation
and device, and it will be shown in a device manager as presented in
Figure 47. If there is a malfunctioning issue, it will be shown at this point.

86

�Figure 47. ADB Driver Verified; Android Device Connected

Rooting the Device
Device rooting is needed in order to obtain privileges for the full access
to a system, or a non-volatile memory landscape. This step is critical to
get root privileges for forensic activities. Process requires to:


Connect device to USB



Start the rooting tool

When the Android device is connected to the workstation, it will appear
in a tray (Figure 48.), as well as in device manager under control panel.

Figure 48. Android Device Connected

87

�In order to check adb connection, it is necessary to start the command
ADB DEVICES from the following location:
C:\Users\&lt;username&gt;\AppData\Local\Android\sdk\platform-tools
This is the location where platform tools with adb utility are installed.
Figure 49. shows that workstation has been successfully communicated
with the mobile device named 8d62f4b5.

Figure 49. Successful Communication to Mobile Device over ADB

Before using rooting tools, some precautions must be taken. Rooting is a
powerful process and it can lead to a damage of phone and/or evidence. If
the rooting process is used under normal circumstances, then it
immediately leads to the warranty void. Antivirus and firewall setup can
interfere with normal operations. Checking and testing connection should
be done before the usage.
88

�Starting tool for rooting will show the basic data. Introduction screen
shows data about the device and the start button (Figure 50.). If the device
is recognized, then the process can be initiated by pressing the “root”
button.

Figure 50. Lenovo Rooting Start

Progress will last for a couple of minutes and will be shown in the
application. During the process, device screen will display the status of
rooting (Figure 51.).

89

�Figure 51. Device Status During Rooting Process

When the process is successfully completed, the message “succeed” will
appear. Each brand has its own supporting software, but there are many
other applications used for root checking, one of which is the
RootChecker.

90

�Figure 52. Lenovo Moto Smart Assistant Device Status

Lenovo Moto Smart Assistant was used to check the status of the device
(Figure 52.).
Busybox Sideloading
Since Android is a Linux-based operating system, it is quite useful to have
it installed on your device. After checking the adb connection to device,
it is necessary to place the .apk busy box file (ru.meefik.busybox_34.apk)
within the folder /android-sdk/platform-tools. Adb is available in the same
location.
In order to sideload the application, run the following command in
command line (Figure 53.):

91

�Adb install ru.meefik.busybox_34.apk

Figure 53. Sideloading BusyBox Over ADB

In order to check if the installation was properly completed, type busybox
in the device shell to see whether it starts (Figures 54, and 55.). Available
commands will be listed.

Figure 54. Starting Busybox

In order to use command SHA1SUM from Busybox toolset to calculate
hash value of the file ueventd.rc, type #busybox sha1sum ueventd.rc
(Figure 55.).

92

�Figure 55. Testing Busybox Tool Sha1sum

Determining Partitions and Blocks
Since Android is a Linux-based operating system, partitions are organized
in the same way as every other Linux OS. Knowledge of partitions, names,
and mount points is necessary in order to get to the right place and
determine the source of data before the imaging process begins. A simple
command to list partitions is:
adb shell – to get to the andoid device
cat /proc/partitions

Running these commands will give an overview of what is happening on
the partition level, thus, helping understand which block belongs to which
partition name (Figure 56).

93

�Figure 56. Android Block Names

Another way to obtain information about dev block names is adb shell
ls –la /dev/block/platform/7824900.sdhci/by-name

7824900.sdhci is not a common name for all devices, because it varies. It
is also the subject of the investigation.
Running the command stated above will show results with more familiar
names (Figure 57.).
During the imaging process it is important to decide which blocks will be
captured and transferred. Usually a whole memory landscape (mmcblk0)
is captured and transferred, however, in some special occasions only a
single block might need imaging (e.g. mmcblk0p2). Names may vary, and
they are subjects of device examination.

94

�Figure 57. Android Partition Names and Blocks

Acquiring Data from the Evidence Device
Data from a device will be acquired by applying two methods, namely
Physical and Logical data acquisition.
Logical data acquisition
To start the acquisition, Android device must have a debugging option
enabled, and working adb. From the Linux command line start the
command: aflogical-ose and then enter sudo password (Figure 58.).

95

�Figure 58. Starting AFLogical OSE acquisition

Before pressing Enter to pull data on the device, it is necessary to mark
interesting logs for acquisition, and then press the “capture” button
(Figure 59.).

Figure 59. Device Capture Options

Data is transferred to the remote folder with data packed in a comma
separated value format (Figure 60).

96

�Figure 60. AFLogical OSE Data Extraction and Transfer

Acquired data can be found in folder /home/nera/aflogical-data/ (Figure
61).

Figure 61. Acquired Data in Remote Folder
97

�Data in this folder shows only what logically exists in the phone records
regarding logs we were offered, and which we selected during the initial
logical acquisition step. Deleted records are not available.
Physical data acquisition
In this process, the imaging command of the /dev/block will be issued and
at the same time the transfer over adb link using redirection will be
initiated. Netcat utility will allow forwarding commands across the adb
link.
For the imaging process, Linux command dd will be used. Syntax is:
dd if=/mountpoint of=/destinationpoint/partitiontype
of – Output can be redirected thru netcat (nc) to remote file
dd if=/mountpoint | busybox nc –l –p portnumber

Obtaining data from the source device will be done through two opened
concurrent shells in Santoku investigative workstation (Figure 62.). This
process can take some time. In this case, 7818182656 bytes were
transferred in 7836.341 seconds (approximately 130 minutes).
Remote destination should have enough storage to receive an image.
Another important factor is the type of file system being formatted.
FAT32 will not be able to accept a file larger than 4GB.

98

�Shows if there is a device present at
the adb connection. If the device is
present and communication
successful, the name will appear. In
this case device with name 8d62f4b5
is present.

This command setup is forwarding host port 6970
over TCP protocol to remote device port 6970
over TCP (in this case this is the receiving side
Santoku Linux – investigative workstations
waiting at the SHELL 2)

su command on the remote
shell
Initiate the remote shell to
the only connected device
Copying content /dev/block/mmcblk0 to remote
destination port 6970 using BusyBox Netcat utility

Nc is command used to start NetCat utility to transfer data. In this case netcat is
receiving data from previously started transfer of mmcblk0 block using dd command on
port 6970. Received content will have the name Digital_Evidence_Android_01.dd. This
image will be used later, first to calculate the hash value, and then for the forensic
analysis.

Figure 62. An integrity of the evidence image file

In order to maintain integrity check of the obtained image file, hash
calculation has to be performed and documented (Figure 63.). Calculated
hash value is checked through the entire process, and complete life cycle
of evidence.
Command issued in the shell is:
99

�Sha256sum Digital_Evidence_Android_01.dd

Figure 63. Calculating Hash Value of the Evidence Image

Importing Image File into Autopsy
Before the analysis starts, collected image file needs to be imported into
tool Autopsy 4.5.0. This process can take a while depending on a size of
the image file. During the image collection process, dd command is used
to collect the whole image of Android device including unallocated space
for allowing a deeper analysis. During the initial case creation, option
Disk Image or VM File was chosen as a data source. Ingestion module is
left with default settings fully marked with all available options.

Analysis of the Acquired Mobile Device Data
Data acquired with both methods logical and physical will be the subject
of the investigation.
Analysis of Logically Acquired Data
Logical acquisition is simple, and all data acquired from the phone is
located in one folder with names which correspond to data (Figure 64).
100

�Figure 64. Files Containing Acquired Data

Figure 65. shows the content of the file SMS.csv.

Figure 65. Content of SMS File

CallLog Calls.csv file contains data about calls. Corresponding records
are found in the listing. Figure 66. shows that call is made to number
062342097, date is formatted as EPOCH 2 date time format, and
1512211049405 is 2.12.2017 11:37:29.405., with duration of 30 seconds.

Figure 66. Content of CallLog Calls File
2

The Unix epoch (or Unix time or POSIX time or Unix timestamp) is the number of seconds that have
elapsed since January 1, 1970 (midnight UTC/GMT).

101

�None of the other applications’ log data was retrieved during the logical
acquisition using AF Logical OSE tool. Other matches except voice call
were found (Table 3. and Table 4.).
TABLE 4. Overview of Logically Acquired Data for Reporting Person 1
Report
061abcdef
SMS message
Viber message
Viber call

Viber threating
photo

Content

Date/time of receipt

Hi beauty, I saw you
yesterday.
I’m in love with you.

3.2.2018 15:23

Evidence/Logical
acquisition found
NO

2.2.2018 10:27
2.2.2018 10:31 call duration
1:03 sec

NO
NO

3.2.2018 15:29

NO

Picture of the
message “Are you
afraid of the night?”

TABLE 5. Overview of Logically Acquired Data for Reporting Person 2
Report 062342097

Content

Date/time of receipt

GSM Voice call

-

Coco message

Careful with your door
lock
You promised me not to
leave me alone. Now
you will regret.

2.12.2017 11:37 duration
30 seconds
3.2.2018 15:25

Coco message

2.2.2018 10:42

Evidence/Logical
acquisition found
YES
NO
NO

Analysis of the Physically Acquired Data
Physical analysis begins with the Autopsy tool first. Full Android mobile
device image Lenovo_Android05 is imported and ingest module runs on
data with task configured at the beginning. Autopsy also searches
unallocated space. It could particularly be interesting in case of hiding
data or recovering deleted data.

102

�Autopsy mounted 35 partitions (Figure 67.). Partition vol34 – userdata is
the place where all applications hold data.

Figure 67. Autopsy Mounted Partition from the Evidence Image

Table 6. lists collected information about applications in the scope of
investigation.
TABLE 6. Collected Data about Applications in Investigation Scope
Application
name
Viber
SMS
Coco msg/voice
GSM Telephone
dialler

Location of application

Location of database

/data/com.viber.voip

/data/com.viber.voip/databases

/data/com.android.provi
ders/telephony
/data/com.instanza.coco
voice
/data/com.android.provi
ders.contacts

/data/com.android.providers/telephony/
databases
/data/com.instanza.cocovoice/databases
/data/com.android.providers.contacts/da
tabases

Database
names
Viber_mess
ages
Mmssms.db
59317329_c
oco.db
Contacts2.d
b

103

�Viber Message and Call Investigation
Viber investigation searched for evidence to match data from the table
from the beginning of the case. The goal was to prove the existence of
digital trail related to Viber. Table 7. shows receiving report from user
061abcdef.
TABLE 7. Viber Message and Call Investigation
Report 061abcdef
Viber message
Viber call

Content
I’m in love with you.

Date/time of receipt
2.2.2018 10:27
2.2.2018 10:31 call duration 1:03 sec

Viber threating photo

Picture of message “Are you
afraid of the night?”

3.2.2018 15:29

First of all, we need to locate the proper partition and data path, found in
the Viber database (Figure 68.).

Figure 68. Viber Database Location and Metadata
Searching for the Viber Message – “I’m in love with you”

104

�In order to find the message, database needs to be extracted to the
operation folder (right click on database – extract) and then opened in DB
Browser for SQLite.
Viber database structure is shown in Figure 69. Tables messages and
messages_calls will be the subject of analysis because they contain data
interesting for the investigation.

Figure 69. Viber Database Structure

Executing an SQL command over the table messages in database
viber_message will yield results which is proof that the message “I’m in
love with you” was sent from the phone (Figure 70.). Epoch data
1517563641920 is 2.2.2018 10:27:21.920

105

�Figure 70. Retrieve Data About Message from Table Messages
Searching for the call 2.2.2018 10:31; call duration 1:03 sec

The following step is to find the trail for Viber call to 061abcdef on
2.2.2018 at 10:31; call duration 1:03 sec. Table message_calls contains
data. Executing an SQL command with parameters needed to narrow
query will return data which is a proof that the call was made from this
phone (Figure 71). Epoch time 1517563892604 is equal to 2.2.2018
10:31:32.604.

106

�Figure 71. Retrieve Data About Calls from Table Messages_Calls
Searching for the sent picture of the message “Are you afraid of the night?”

The following task is to find the Viber picture/photo of the threatening
message “Are you afraid of the night?” sent 3.2.2018 at 15:29.
Table messages in viber_messages database shows the record of a deleted
message (Figure 72.).
Other than date/time value and status of the message, other available data
is not in the scope of the investigation.

Figure 72. Viber Database Records
107

�Epoch 1517668146820 is 3.2.2018 15:29:06.820 which corresponds to
date and time from the initial search table. Another step is to search
unallocated space for deleted pictures. Autopsy has a strong engine
inspecting files according to the ingest module configuration. Picture was
found as a deleted file (Figure 73.).

Figure 73. Recovered Deleted Picture

Additional data about the file is shown in Figure 74.

108

�Figure 74. Recovered Deleted Picture Metadata

SMS Message Investigation
The scope of this investigation is database where SMS messages are
stored. Initial data we were searching for is shown in Table 8.
TABLE 8. SMS Message Investigation
Report 061abcdef
SMS message

Content
Hi beauty, I saw you yesterday.

Date/time of receipt
3.2.2018 15:23

According to the previous mapping of the application location, SMS
messages

are

stored

in

database

mmssms.db

located

in

/data/com.android.providers/telephony/databases. After the process of
database extraction to the operational folder, the examination of the
database structure is performed (Figure 75). Table named sms should have
data about messages. Other tables were opened, and attributes were
checked. Depending on the scope of the investigation, some other tables
can be subject to a detailed analysis.

109

�Figure 75. MMSSMS Database Structure
Searching for the sms message “Hi beauty, I saw you yesterday”

No other tables except the sms table contained the needed records. The
investigation shows that records in table sms do not contain data about the
message “Hi beauty, I saw you yesterday” (Figure 76.). It is assumed that
the message is deleted from the database because executed SQL
commands do not retrieve any data on setup condition. Other tools should
be used to perform the possible data recovery at database level.

110

�Figure 76. Retrieve Data about Calls from Table SMS

SQLite-Deleted-Records-Parser tool could help determine deleted data in
database. Start tool with mmssms.db and output file mmssms.txt. After
that, the execution message is found in unallocated space (Figure 77.).

111

�Figure 77. Recovered Deleted Database Record

GSM Voice Call Investigation
The scope of the GSM voice call investigation will be database where data
records are stored. Initial data we were searching for is shown in Table 9.
TABLE 9. GSM Voice Call Investigation
Report
062342097
GSM Voice
call

Content

Date time of receipt

-

2.12.2017 11:37 duration 30 seconds

Voice call log records can be found in database contact2.db located in
/data/com.android.providers.contacts/databases. The structure of database
after the extraction to the operational folder is shown in Figure 78.

112

�Figure 78. Contact2 Database Structure
Searching for the GSM voice call 2.12.2017 11:37 duration 30 seconds

Table calls should have data related to executed call, incoming as well as
outgoing call. Executed SQL command retrieves data about call dated in
the table at the beginning of the investigation (Figure 79.). Epoch
1512211049405 is 2.12.2017 11:37:29.405.

113

�Figure 79. Retrieve Data About Calls from Table Calls

Coco Message Investigation
Coco messenger is not a widespread application. It supports messaging
and voice communication. According to the previous analysis and
application location mapping database, 59317329_coco.db is located in
/data/com.instanza.cocovoice/databases. Initial data we were searching
for is shown in Table 10.
TABLE 10. Coco Message Investigation
Report
06234209
7
Coco
message
Coco
message

Content

Date/time of receipt

Careful with your door lock

3.2.2018 15:25

You promised me not to leave me alone. Now
you will regret.

2.2.2018 10:42

The structure of database after the extraction to the operational folder is
shown in Figure 80.

114

�Figure 80. 59317329_coco Database Structure
Searching for the message “Careful with your door lock”.

Table ChatMessageModels should have data related to messages.
Executed SQL command did not have any data about the message (Figure
81).

115

�Figure 81. Retrieve Data about Chat Message from Table Content

SQLite-Deleted-Records-Parser tool retrieved deleted database data from
the source file database 59317329_coco.db and output file coco.txt. After
that, the execution message was found (Figure 82).

Figure 82. Recovered Evidence Message from Deleted Database Record
116

�Searching for the Message “You promised me not to leave me alone. Now you
will regret.”

Table ChatMessageModels should have data related to messages.
Executed SQL command retrieved data about the call dated in the table at
the beginning of the investigation (Figure 83). Epoch 1517564524520 is
2.2.2018 10:42:04.520.

Figure 83. Retrieve Data about the Message from Table Content

Investigation Findings
The investigation was completed by summarizing discovered digital
artefacts on the perpetrator’s Android mobile device. Quantitative data is
shown in Table 11.
TABLE 11. Quantitative Data about Found Evidence

Viber
SMS
Coco
GSM calls
Total
Percentage

Number
of
reported/expected
digital artefacts
3
1
2
1
7
100%

Logically
acquired
artefacts
0
0
0
1
1
14.2%

Physically
acquired
artefacts
3
1
2
1
7
100%
117

�Summary of data shows that the team proved the existence of the searched
data in the mobile device. Investigation started with 7 reported
messages/calls/photos. That was the foundation for defining the scope of
the investigation and tools needed to carry it out. During processes, two
methods of data acquisition were used, namely Logical and Physical data
acquisition. It is obvious that using AF Logical OSE tool for the logical
acquisition was not enough to obtain the necessary data – especially when
data was deleted (SMS) – and other Internet services such as Viber and
Coco messenger and deleted photographs.

Ending Investigations
All collected evidence findings were submitted according to the rules and
procedures. The report is handed over to the authorities together with the
evidence. The evidence was used in the court. It is not known what
happened to the perpetrator.
Figure 84. shows the report summary with data about case such as case
name, case number, examiner name, time zone, and the location of the
taken image.

118

�Figure 84. Report Summary

Figure 85. shows tagged files for evidence. Evidence list contains the
exact location of evidence within the partition.

Figure 85. Report of the Evidence Tagged Files and Locations

Report navigation offers grouping of data by categories of keywords hits,
tagged files, tagged images, and tagged results. The report showed in
Figure 85. included files and images as the evidence trail.

119

�Case 3: Database forensics – user complaints on high bills
The complain centre in the Internet provider’s company received the
complaint from the customer about high bills at the end of the month.
Management ordered forensic analysis, so internal forensic investigators
began the forensic analysis on the RACUNI_USER_USER table where
customer account details were kept to investigate the potential suspicious
activity. The forensic analysis of the table RACUNI_USER_USER
should indicate if there was an unauthorized change, and if yes, when and
who did the changes.
The report with IBM Guardian was created for the given table, and the
result of the report is shown in Figure 86.

aaa.bbb.cc.dd

aaa.bbb.ii.jj

aaa.bbb.cc.dd

aaa.bbb.ii.jj

aaa.bbb.cc.dd

aaa.bbb.ii.jj

Figure 86. IBM Guradium report for the customer complaints

The report shows details indicating that there has been a change in the
table, that is, in the set values for MOBILE, FIXED for two customers and
INTERNET for one customer. We can notice that DB USER is an
unclassified person (attacker) who came from the IP address:
aaa.bbb.cc.dd where the service account ESJEDNICE_TST was logged
on.

120

�By inspecting a HOST that corresponds to an IP address, it was confirmed
that it is a file server of the Internet provider company (BH TELECOM)
domain.
aaa.bbb.cc.dd
aaa.bbb.cc.dd
aaa.bbb.cc.dd
aaa.bbb.cc.dd
aaa.bbb.cc.dd

Figure 87. IP resolution

Digital forensic investigators detected a criminal attempted to conceal
evidence by logging in with a service account on the FILE server. Attacker
used the file server to start SQLPLUS tool with the user ATTACKER to
access the database and make unauthorized changes in the table.
The next logical step in the forensic investigation was to try to find out
who was hiding behind the username ATTACKER, or who gave the rights
(rights to the database) to the ATTACKER who made the changes in the
table. Information is presented in Figure 88.
aaa.bbb.ii.jj

aaa.bbb.ii.jj

aaa.bbb.ii.jj

aaa.bbb.ii.jj

aaa.bbb.ii.cc

aaa.bbb.ii.jj

aaa.bbb.ii.cc

aaa.bbb.ii.jj

aaa.bbb.ii.cc

aaa.bbb.ii.jj

aaa.bbb.ii.cc

aaa.bbb.ii.jj

Figure 88. Report from IBM Guardium shows ATTACKER creator

121

�User ATTACKER was created by one of the administrators
(MIRZA_ADMIN) through SQLPLUS on a local server, and granted
through the Oracle Enterprise Manager Tool.

Case 4: Database forensics – Salaries data leakage
Company management initiated the forensic analysis after salary details
were revealed in the media. Due to disclosure of the confidential
information, a written request from the management was made to conduct
a detailed forensic investigation of the database to determine who and how
accessed the table with data about salaries. Fact known by forensic
investigators was that there were two tables containing the incriminated
data. One table contained data on salaries and another on employee names.
The next report in the IBM Guardium tool, which follows the sensitive
tables, shows the events related to this case (Figure 89.).

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

Figure 89. IP address, username, and SQL command

The first report shows that the undefined user POM_2015 connected to
the database using the SQLPLUS tool, from the machine whose IP address
is: aaa.bbb.ee.ff where the user is esjednice_stst1, and created tables with
contents of the table PLATE (SALARIES) and UPOSLENIC_FIRME
(COMPANY_EMPLOYEES).

122

�Figure 90. shows DNS name of PC with address aaa.bbb.ee.ff which
determines PC ucionica (classrom1). This is an example of a fraudulent
activity where the HOST classroom1 is used to hide database access
traces. Another important issue is that the access to tables with salaries
and table with names was not direct. Rather, in order to cover tracks, two
so-called “help tables” were created (IZVJ_2015 and HR_IZVJ_2015)
with data from sensitive tables.
aaa.bbb.ee.ff
aaa.bbb.ee.ff
aaa.bbb.ee.ff
aaa.bbb.ee.ff
aaa.bbb.ee.ff

Figure 90. IP Address name resolution

From the fact that two additional tables were created for sensitive data
access, we can understand that the attacker assumed that there were
certain tools which followed the access to the above tables, and tried to
obtain data from sensitive tables indirectly. The next step for the forensic
team was to go into a deeper analysis of user POM_2015 and tables
created by this user which indicated illegal activities on the database.

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

aaa.bbb.ee.ff

aaa.bbb.ii.jj

Figure 91. View detailed POM_2015 user-related activities
123

�Figure 91. shows the chronological overview of the user POM_2015 and
administrator MITZA_DBA criminal activities on the database. After
POM_2015 created the auxiliary tables from which s/he collected the
information, s/he wiped it out to cover up the evidences. However, the
IBM Guardium tool recorded one more item here, which is that in this
procedure, a user (in this case, MIRZA_DBA) appeared, which erased the
user who committed the criminal activity.
Forensic analysis led to very important information indicating a valid
trace, i.e., the fact that the administrator (MIRZA_DBA) was actually
responsible for the criminal activity (Figure 92.).

aaa.bbb.ee.ff
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.gg.hh
aaa.bbb.ee.ff
aaa.bbb.ee.ff

aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj
aaa.bbb.ii.jj

Figure 92. Details of the report about the creation of the user POM_2015
and granted access rights

The forensic analysis presented in the previous report clearly shows when
the user was created and in what way, and how he obtained privileges over
the tables in order to access the database. In conclusion, we can notice that

124

�the account and tables were deleted in order to try to conceal the proof of
the criminal activity.

Case 5: Database forensics – data deletion
Company’s marketing department discovered that data from a database
was deleted and requested the investigation. Human resources also
discovered

that the column with monthly employees’ salaries in the

database table was deleted. Thus, they initiated data recovery from the
backup, however, before the procedure of restoring data from the backup,
management wanted to report who, what, when, and in what way deleted
data from the database.
The report generated using IBM Guardium for the table where the data
was deleted shows who deleted data, when and how that happened, and
which tool was used.

aaa.bbb.ee.ff

aaa.bbb.ii.jj

Figure 93. A forensic report related to deleted data in the table

As shown in the IBM Guardium report, the user who is responsible for
deleting all data from the table NOVE_USLUGE is TRON555.

Figure 94. Report on details of creation and assignment of privileges for
the user TRON555
125

�However, when the team tried to further explore the origin of the user, i.e.
when it was created and who created it in the IBM Guardium, they failed.
The forensic investigator realized that the attacker was well-acquainted
with the IBM Guardium system and managed to hide the trace of creating
and granting rights to the user who cleared all data in the table.
The following forensic analysis showed that the attacker knew that there
were users which were not recorded by the IBM Guardium when
monitoring changes in the database. These users began the service and
they were used to run backup scripts, which were excluded from
monitoring through the IBM Guardium tool which was permitted by the
management.

Figure 95. View exception rules for users who are not treated through
IBM Guardium

Figure 96. shows that the attacker might have used one of the two
mentioned users in order to circumvent the system and thereby attempt to
hide the true trail indicating who is responsible for an unauthorized action
of deleting data in the table. However, s/he did not consider that the
forensic investigator had other methods and tools which could lead to

126

�evidence. By inspecting the redo log file with the LogMiner tool, the
requested response indicated which user was behind the user TRON555.

Figure 96. LogMiner Detailed report for the creation and permitting
access for the TRON555 user

However, since this was the service user account, the forensic investigator
had to investigate further to see who enabled the user OPER to create and
assign rights to users in the database or delete data from the table. The
report received through the IBM Guardium gave the answer to this
question and at the same time the solution to another request that came
from the Human resources regarding deleted data containing salaries from
the NAKNADE_USER table.

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

aaa.bbb.gg.hh

aaa.bbb.ii.jj

Figure 97. Details of the report related to deleting a column in the table

127

�The report shows that the user OPER was created on the computer whose
IP address was aaa.bbb.gg.hh and on which the user MIRZAHAL has
been registered with the help of the SYS base user. The user OPER was
assigned rights to delete the column in the table.
The report from logMiner shows that the same user (OPER) was used to
create another user (TRON555) who deleted the data from the
NOVE_USLUGE table.
This test scenario is an indication that an attacker will always search for a
"weak point" of the systems, programs, equipment, or devices. Attackers
seek weak points in an attempt to hide themselves, thus avoiding any
possible liability for the committed crime.

Summary
Cyber security is a subset of the information security which deals with the
security of information stored in digital form and transferred over
communication links. A great part of information security related
standards deals with cyber security issues. Almost daily, media reports
reveal cyber security related incidents. After the historical analysis, we
can conclude that we will see an increase in incidents of this type,
especially as more services and users use digital technology in their
everyday work and life.

Knowledge acquired
Forensic data recovery of files on PC, forensic data recovery of Viber,
voice call, SMS, and Coco on an Android mobile phone. Database

128

�forensic related to user complaints on high bills, salaries data leakage, and
data deletion.

Review questions
1. How attacker can hide wrongdoings?
2. Location of database on mobile Android phone?

Further readings
-

Digital transformation: online guide to digital business
transformation https://www.i-scoop.eu/digital-transformation/

-

The Cyber Security Management System: A Conceptual Mapping,
SANS Institute InfoSec Reading Room
https://www.sans.org/reading-room/whitepapers/basics/cybersecurity-management-system-conceptual-mapping-591

Video resources
-

The case of the stolen exams

https://www.youtube.com/watch?v=1BVG6cmPlPk
-

Digital Forensics – Famous Cases

https://www.youtube.com/watch?v=gPuugbpLOeI

129

�130

�6. Conclusions

Chapter abstract
Chapter goals: To summarise book goals and review gained knowledge.
Cybercrime is much different from the conventional crime related to the
physical world. There are a lot of challenges for the law enforcement and
organisations who are victims of the cyber-crime. There is not much
difference between crimes in cyber and physical space, however, in cyber
space there is a lot more data and ways in which criminals could hide it.
Also, it is more challenging to perform the digital forensic investigation
because specific data can be found in volatile or non/volatile memory.
Another challenge is the fact that criminals do not have boundaries, while
boundaries between different countries’ jurisdictions exist.
Digital forensics is still in the process of development, and is constantly
being upgraded with the latest scientific advancements and new practices.
Technology progress must be followed by the goal to be ready to face new
challenges in form of crime techniques in the cyberspace.
Additional professional, legal, and scientific efforts have to be invested to
improve the existing practices to combat cyber criminals. It is a
professional duty to support activities and develop techniques and
infrastructures to fight against the misuse of cyber resources.

131

�This book presents the range of free digital forensic tools which can be
used by students as a guide to develop and practice their skills.
We presented several simulated cases of digital forensic investigations
with documented evidence, and steps which can be followed in similar
situations.
Furthermore, expert witnesses can present the evidence from real digital
forensic cases at the court by following steps and using tools presented in
this book, or similar procedures and tools accepted in local and
international jurisdiction.
Finally, the digital forensic investigator must continuously upgrade
knowledge about cases, tools, best practices, and technology. Technology
is developing very fast, so even some tools presented in this book might
already be outdated, which is why reading and lifelong learning is
important for a successful combat against the cyber-crime.

132

�Appendix – Consent Form

I, _______________________________(name and surname), (DOB
____/____/____),
hereby
authorizes
__________
____________________________________,
an
__________________________________________________ (function title),
to take custody and analyse the items detailed below for evidence. I understand
that copies of the contents of the items, including all files and data, may be
copied and retained for the analysis. I also understand that the analysis of the
copies of the media may continue even after the items designated for the
analysis are returned. I provide my consent to this analysis freely, willingly, and
voluntarily, and with the knowledge that I have the right to refuse to consent. I
provide my consent without fear, threat, coercion, or promise of any kind.
Device

Serial number

Additional owner/user
details

Owner’s printed name

Signature

Witness’ printed name

Signature

Witness’ printed name

Signature

133

�Appendix – Incident response form

General data about incident


System under attack



Incident investigation in progress



Incident closed

Required assistance:_________________________________________
Which data, service, project is under an impact:
__________________________________________________________
__________________________________________________________

Type of incident


Malicious software



DoS/DDoS attacks



Unauthorized access



Leakage of data and information in public

Date and time of the incident:
_____________________________________
Brief summary:
__________________________________________________________
__________________________________________________________
__________________________________________________________
134

�Details for malicious software:
Source (mail, web page, mobile memory such as USB):
____________________________________________________

Type: (virus, Trojan, worm, spyware, other):
__________________________________________________________
__________________________________________________________

DoS / DDoS attack
Attack source:
__________________________________________________________
Service attacked (OS version, IP address):
__________________________________________________________
Type of DoS / DDoS traffic:
__________________________________________________________

Details for an unauthorized access:
__________________________________________________________
__________________________________________________________

Leakage of data and information in public:
135

�__________________________________________________________
__________________________________________________________

Appendix – Digital forensic process

136

�137

�List of Figures

Figure 1. Word “Forensic” explanation (google, 2018) ......................................2
Figure 2. Digital and Computer forensic realm ...................................................6
Figure 3. Computer forensic................................................................................9
Figure 4. Network forensics ..............................................................................10
Figure 4. Forensic analysis goals to detect – who, what, when, where .............12
Figure 5. Incident response plan (Banking and Insurance, 2017) .....................13
Figure 6. Digital and Cyber forensic types........................................................18
Figure 7. Steps in the Digital Forensic Investigation Process ...........................28
Figure 8. Application analysis ...........................................................................35
Figure 9. Sample_file.txt content ......................................................................43
Figure 10. Creating concealed message in sample_file1.txt content .................44
Figure 11. Creating concealed message in sample_file1.txt content .................44
Figure 12. Reading concealed message in sample_file1.txt content .................45
Figure 13. File sizes comparison .......................................................................46
Figure 14. Hard disk docking station (Renkforce, 2019) ..................................50
Figure 15. Memory card docking station (Logilink, 2019) ...............................51
Figure 16. Portable Computer Forensic Lab Road MASSter 2, 2019 ...............52
Figure 17. Disk Genius......................................................................................53
Figure 18. Calculating Hash Value ...................................................................54
Figure 19. Q Capture program works with LogMiner to retrieve changed data
IBM Knowledge, Center, 2013 .........................................................................55
Figure 20. View all transactions for user, Nanda A., 2019 ..............................56
Figure 21. LogMiner results, Nanda A., 2019...................................................56
Figure 22. LogMiner results, Nanda A., 2019...................................................57
Figure 23. IBM Guardium (2019) Navigation Overview ..................................57
Figure 24. IBM Guardium (2019) Out of the box creation ...............................58
Figure 25. DB Browser for SQLite ...................................................................59
Figure 26. FTP connection ................................................................................61
Figure 27. Captured FTP connection with Wireshark .......................................61
Figure 28. NIKSUN NetDetector, 2019 ............................................................62
Figure 29. Xplico (2019) ...................................................................................63
Figure 30. Kingo Android Root ........................................................................64
Figure 31. Santoku Linux ..................................................................................65
Figure 32. Santoku Linux Download ................................................................65
Figure 33. AFLogical OSE................................................................................67
Figure 34. Autopsy Main Operations Screen ....................................................68
138

�Figure 35. Type of Data Source ........................................................................69
Figure 36. Autopsy Ingest Module ....................................................................71
Figure 37. Android Analyzer.............................................................................72
Figure 38. Access to Imaged Partitions .............................................................73
Figure 39. Timeline – View Counts ..................................................................74
Figure 40. Filter Events Categories ...................................................................75
Figure 41. Timeline - View Details ...................................................................75
Figure 42. Report Formats ................................................................................76
Figure 43. Report - Case Summary ...................................................................77
Figure 44. Report - Tagged Images ...................................................................77
Figure 45. Disk Genius access to the investigated hard disk ............................82
Figure 46. Disk Genius data copy .....................................................................83
Figure 47. ADB Driver Verified; Android Device Connected..........................87
Figure 48. Android Device Connected ..............................................................87
Figure 49. Successful Communication to Mobile Device over ADB ...............88
Figure 50. Lenovo Rooting Start .......................................................................89
Figure 51. Device Status During Rooting Process ............................................90
Figure 52. Lenovo Moto Smart Assistant Device Status ..................................91
Figure 53. Sideloading BusyBox Over ADB ....................................................92
Figure 54. Starting Busybox..............................................................................92
Figure 55. Testing Busybox Tool Sha1sum ......................................................93
Figure 56. Android Block Names......................................................................94
Figure 57. Android Partition Names and Blocks...............................................95
Figure 58. Starting AFLogical OSE acquisition................................................96
Figure 59. Device Capture Options ...................................................................96
Figure 60. AFLogical OSE Data Extraction and Transfer ................................97
Figure 61. Acquired Data in Remote Folder .....................................................97
Figure 62. An integrity of the evidence image file ............................................99
Figure 63. Calculating Hash Value of the Evidence Image ............................100
Figure 64. Files Containing Acquired Data.....................................................101
Figure 65. Content of SMS File ......................................................................101
Figure 66. Content of CallLog Calls File ........................................................101
Figure 67. Autopsy Mounted Partition from the Evidence Image ..................103
Figure 68. Viber Database Location and Metadata .........................................104
Figure 69. Viber Database Structure ...............................................................105
Figure 70. Retrieve Data About Message from Table Messages ....................106
Figure 71. Retrieve Data About Calls from Table Messages_Calls ................107
Figure 72. Viber Database Records .................................................................107
Figure 73. Recovered Deleted Picture .............................................................108
Figure 74. Recovered Deleted Picture Metadata .............................................109
Figure 75. MMSSMS Database Structure .......................................................110
Figure 76. Retrieve Data about Calls from Table SMS...................................111
Figure 77. Recovered Deleted Database Record .............................................112
Figure 78. Contact2 Database Structure ..........................................................113
139

�Figure 79. Retrieve Data About Calls from Table Calls .................................114
Figure 80. 59317329_coco Database Structure ...............................................115
Figure 81. Retrieve Data about Chat Message from Table Content ................116
Figure 82. Recovered Evidence Message from Deleted Database Record .....116
Figure 83. Retrieve Data about the Message from Table Content ..................117
Figure 84. Report Summary ............................................................................119
Figure 85. Report of the Evidence Tagged Files and Locations .....................119
Figure 86. IBM Guradium report for the customer complaints.......................120
Figure 87. IP resolution ...................................................................................121
Figure 88. Report from IBM Guardium shows ATTACKER creator .............121
Figure 89. IP address, username, and SQL command .....................................122
Figure 90. IP Address name resolution ...........................................................123
Figure 91. View detailed POM_2015 user-related activities ..........................123
Figure 92. Details of the report about the creation of the user POM_2015 and
granted access rights........................................................................................124
Figure 93. A forensic report related to deleted data in the table .....................125
Figure 94. Report on details of creation and assignment of privileges for the user
TRON555 ........................................................................................................125
Figure 95. View exception rules for users who are not treated through IBM
Guardium.........................................................................................................126
Figure 96. LogMiner Detailed report for the creation and permitting access for
the TRON555 user ..........................................................................................127
Figure 97. Details of the report related to deleting a column in the table .......127

140

�List of Tables

TABLE 1. Audit vs. Digital forensic investigation .................................................. 7
TABLE 2. Reporting Person 1 Data ......................................................................... 85
TABLE 3. Reporting Person 2 Data ......................................................................... 85
TABLE 4. Overview of Logically Acquired Data for Reporting Person 1 ........ 102
TABLE 5. Overview of Logically Acquired Data for Reporting Person 2 ........ 102
TABLE 6. Collected Data about Applications in Investigation Scope .............. 103
TABLE 7. Viber Message and Call Investigation ................................................. 104
TABLE 8. SMS Message Investigation .................................................................. 109
TABLE 9. GSM Voice Call Investigation............................................................... 112
TABLE 10. Coco Message Investigation ............................................................... 114
TABLE 11. Quantitative Data about Found Evidence ....................................... 117

141

�142

�Acronyms

ACK Acknowledgement
CERT Centre for Emergency Report Team
CISA Certified Information Security Auditor
CISM Information Security Manager
CISP Certified Information Security Professional
CISO Chief Information Security Officer
CISWG Corporate Information Security Workgroup
CSO Chief Security Officer
DMZ Demilitarised zone
DoS

Denial of Service

DDoS Distributed Denial of Service
DML Data Manipulation Language
FTP

File Transfer Protocol

HTTP Hyper Text Transfer Protocol
IA

Internal Auditor

ICMP Internet Control Message Protocol
IDS

Intrusion Detection System

IP

Internet Protocol

IPS

Intrusion Prevention System

IEC

International Electrotechnical Commission

IEEE Institute of Electrical and Electronic Engineers
IPX

Internetwork Packet Exchange

ISACA Information Systems Audit and Control Association
143

�ISM Information Security Manager
ISMS Information Security Management System
ISO

International Standardisation Organisation

ISSEA International Systems Security Engineering Association
IT

Information Technology

KPI

Key Performance Indicator

LAN Local Area Network
MIB

Management Information Base

NIST National Institute of Standards &amp; Technology
NMS Network Management Station
OID

Object identifier

OSI

Open System for Interconnection

PDCA Plan Do Check Act
QoS

Quality of Service

SMTP Simple Mail Transfer Protocol
SNMP Simple Network Management Protocol
SQL

Simple query language

SYN Synchronize
TCP

Transmission Control Protocol

UDP User Datagram Protocol
UPS

Uninterruptable Power Supplies

VPN Virtual Private Network
WAN Wide Area Network

144

�References

AccessData. (2006). White paper: MD5 collision – The effect on
Computer

Forensics.

Available

from:

https://ad-

pdf.s3.amazonaws.com/papers/wp.MD5_Collisions.en_us.pdf
Afonin, O. &amp; Gubanov, Y. (2013, May 28). Catching the Ghost: How to
Discover Ephemeral Evidence through Live RAM Analysis. Forensic
magazine.

Available

from:

http://www.forensicmag.com/article/2013/05/catching-ghost-howdiscover-ephemeral-evidence-through-live-ram-analysis
Appazov, A. (2014). Legal Aspects of Cybersecurity. Faculty of Law
University

of

Copenhagen.

Retrieve

from:

http://justitsministeriet.dk/sites/default/files/media/Arbejdsomraader/For
skning/Forskningspuljen/Legal_Aspects_of_Cybersecurity.pdf
Android.

(2017),

Application

Security,

Available

https://source.android.com/security/overview/app-security

from
accessed

25.9.2017
Android.

(2017),

Platform

Architecture,

Available

https://developer.android.com/guide/platform/index.html#art

from
accessed

23.12.2017

145

�Ayers, R. Brothers, S and Jansen, W. (2014), Guidelines on Mobile Device
Forensics, NIST Special Publication 800-101: Available from
http://dx.doi.org/10.6028/NIST.SP.800-101r1, 20.12.2017 [Accessed on
12.01.2019]
Banking and Insurance, 2017 Available from: http://en.finance.siapartners.com/20171211/cyber-incident-response-how-strong-yourincident-response-plan, [Accessed on 20.01.2019]
Boccaccini, M.T. (2002). What Do We Really Know about Witness
Preparation? Behav. Sci. Law 20: 161–189. DOI: 10.1002/bsl.472
Burnette, Michael W. “Forensic Examination of a RIM (BlackBerry)
Wireless Device.” June 2002. Available from: http://www.rhlaw.com/ediscovery/Blackberry.pdf (accessed 11.1. 2018)
Catts E.P. &amp; Goff M.L. (1992). Forensic entomology in criminal
investigations. Annu
Rev
Entomol. Vol.37:253-272.
DOI:
10.1146/annurev.en.37.010192.001345
Carrier, B. and Spafford, E. (2004). An Event-Based Digital Forensic
Investigation Framework, The Digital Forensic Research Conference, p23. Available from:
https://www.dfrws.org/sites/default/files/session-files/paper-an_eventbased_digital_forensic_investigation_framework.pdf

[Accessed

on

20.01.2019]
Casey, E. (2011). Digital Evidence and Computer Crime: Forensic
Science, Computers
146

�and

the

Internet

(3rd

ed.).

Elsevier

Inc.

Available

from:

http://booksite.elsevier.com/samplechapters/9780123742681/Front_Matt
er.pdf 309 [Accessed on 11.02.2019]
Cellebrite (2017), Cellebrite's Universal Forensic Extraction Device
(UFED), Available from https://www.cellebrite.com/en/home/ (accessed
21.1.2018)
Cosic, J., Cosic, Z., &amp; Baca, M. (2011). An ontological approach to study
and manage digital chain of custody of digital Evidence, Journal of
Information and Organizational Sciences, 35 (1): 1-13
Chow, K.P. &amp; Shenoi S. (2010, January), Advances in Digital Forensics
VI. Sixth IFIP WG 11.9 International Conference on Digital Forensics.
Cho, W. K. T., &amp; Gaines, B. J. (2007). Breaking the (Benford) Law:
Statistical Fraud Detection in Campaign Finance. The American
Statistician, 61(3), 218­223.
Criminal

Justice

Degree

Schools

(2019),

Available

at:

https://www.criminaljusticedegreeschools.com/criminal-justicedegrees/computer-forensic-degree/ [Accessed on 20.02.2019]

Crime

Museum,

2019

Edmond

Locard,

Available

at:

https://www.crimemuseum.org/crime-library/forensicinvestigation/edmond-locard/ [Accessed on 20.02.2019]
Data, Merriam-Webster 2019 Available at:

https://www.merriam-

webster.com/dictionary/data [Accessed on 02.07.2019]

147

�Desertcart. (2018), Palm V Hand held PDA, Available from
https://www.desertcart.ae/products/15557437-palm-v-hand-held-pda htm
[Accessed on 20.01.2019]
Diekmann, A. (2012), Making Use of "Benford's Law" for the
Randomized Response Technique, Article in Sociological Methods &amp;
Research,

DOI:

10.1177/0049124112452525

Available

from

https://www.researchgate.net/profile/Andreas_Diekmann2/publication/2
69815391_Making_Use_of_Benford%27s_Law_for_the_Randomized_
Response_Technique/links/553bae070cf245bdd766705f.pdf

[Accessed

on 20.01.2019]
(DFRWS, 2001), A Road Map for Digital Forensic Research Available
from:

http://dfrws.org/sites/default/files/session-

files/a_road_map_for_digital_forensic_research.pdf

[Accessed

on

02.02.2019]
Edson, J. (2011, July 25). A Brief History of Forensic Science. Australia’s
Science Channel. Available from: http://riaus.org.au/articles/a-briefhistory-of-forensic-science/ [Accessed on 20.12.2018]
Forensic, Merriam Webster, 2018, Available at: https://www.merriamwebster.com/dictionary/forensic, [Accessed on 20.12.2018]
Forensics

and

Benford’s

Law.

(2018),

Event

https://eventlogxp.com/blog/forensics-and-benfords-law/
20.1.2018

148

Log

Explorer,
accessed

�Gadgeter (2018), RIM BlackBerry 950 Review, Available from https://thegadgeteer.com/2001/02/26/rim_blackberry_950_review/

accessed

10.1.2018
Google,

2018,

Etymology

of

word

Forensic,

Available

at:

https://www.google.ba/search?rlz=1C1AVNC_enBA595BA595&amp;q=fore
nsic+etymology&amp;spell=1&amp;sa=X&amp;ved=0ahUKEwi9offs6qPeAhVECyw
KHaDMCM8QBQgnKAA&amp;biw=1366&amp;bih=657

[Accessed

on

26.10.2018]
Grand, J. (2002) pdd: Memory Imaging and Forensic Analysis of Palm
OS
Devices,
https://www.researchgate.net/publication/2490864_pdd_Memory_Imagi
ng_and_Forensic_Analysis_of_Palm_OS_Devices (accessed 20.1.2018)
History of Fingerprints, (2018) Crime Scene Forensic, LLC, Available at:
http://www.crimescene-forensic.com/History_of_Fingerprints.html
[Accessed on 01.11.2018]
IBM Guardium, (2019) IBM Guardium Data Protection for Databases,
Available at: https://www.ibm.com/us-en/marketplace/ibm-guardiumdata-protection [Accessed on 01.11.2018]
IBM Knowledge Center, 2013 How a Q Capture program works with the
Oracle LogMiner utilit, Available at:
https://www.ibm.com/support/knowledgecenter/SSTRGZ_10.2.0/com.ib
m.swg.im.iis.repl.qrepl.doc/topics/iiyrqcapclogminercnc_ep.html
[Accessed on 15.11.2018]

149

�IDC. (2017), Smartphone OS Market Share, 2017 Q1, Available at:
https://www.idc.com/promo/smartphone-market-share/os

accessed

5.12.2017
IIA, 2019, Institute of Internal Auditors, 2019, Definition of Internal
Auditing,

2019,

Available

at:

https://na.theiia.org/standards-

guidance/mandatory-guidance/pages/definition-of-internal-auditing.aspx
[Accessed on 20.01.2019]
IOCE. (1999). IOCE Principe &amp; Definitions.
Available from: https://archives.fbi.gov/archives/about-us/lab/forensicscience-communications/fsc/april2000/swgde.htm

[Accessed

on

20.01.2019]
Information,

Merriam-Webster

2019,

Available

from:

https://www.merriam-webster.com/dictionary/information [Accessed on
20.05.2019]
Information system, Britanica, 2019, Information system, an integrated
set of components for collecting, storing, and processing data and for
providing information, knowledge, and digital products, 2019 Available
from: https://www.britannica.com/topic/information-system [Accessed
on 20.01.2019]
Information technology, Merriam-Webster, 2018, Available from:
https://www.merriam-

150

�webster.com/dictionary/information%20technology,

[Accessed

on

20.01.2018]
Infosec

Institute.

(2017),

Computer

Forensics

Salary

Data,

http://resources.infosecinstitute.com/category/computerforensics/introdu
ction/computer-forensics-salary-data/#gref accessed 19.12.2017
Kaur, R. &amp; Kaur, A. (2012). Digital Forensics. International Journal of
Computer

Application

(0975-8887),

50(5),

2-4.

Available

at:

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.258.7882&amp;rep
=rep1&amp;type=pdf [Accessed on 20.01.2019]
International

Telecommunication

Union.

(2014).

Understanding

cybercrime: phenomena, challenges and legal response. Report.
Available

from:

http://www.itu.int/en/ITU-

D/Cybersecurity/Documents/cybercrime2014.pdf

[Accessed

on

20.01.2019]
Kremic E.; Subasi A.; Hajdarevic K., Face recognition implementation for
client server mobile application using PCA, Proceedings of the ITI 2012
34th International Conference on Information Technology Interfaces,
Year: 2012 Page s: 435 – 440
Law

Enforcement

Cyber

Center

(2017),

Available

http://www.iacpcybercenter.org/officers/digital-evidence/

at:

accessed

15.12.2017

151

�Lee, K. Lee, Y. Lee, H. and Yim, K. (2016), A Brief Review on JTAG
Security, 2016 10th International Conference on Innovative Mobile and
Internet Services in Ubiquitous Computing DOI 10.1109/IMIS.2016.102
Levin. J, (2015), Android Internals: Power User's View (1st edition),
Cambridge: Technologeeks.com
Litchfield D., Oracle Forensic Part 1: Dissecting the Redo Logs, An
NGSSoftware Insight Security Research (NISR) Publication ©2007 Next
Generation Security Software Ltd.
Logilink,2019,

Available

at:

http://www.logilink.eu/media/images/produkt/_800/CR0012.png
[Accessed on 20.11.2018]
Lynch, V.A. &amp; Duval J.B. (2011). Forensic Nursing Science (2nd ed.).
Elsevier Mosby p2
Marcella A. J.

and Menendez D. Cyber Forensic, Second Edition,

Auerbach Publication, 2008
Massachusetts Digital Evidence Consortium, 2015, Digital Evidence
Guide for First Responders,
Available

from:

http://www.iacpcybercenter.org/wp-

content/uploads/2015/04/digitalevidence-booklet-051215.pdf [Accessed
on 20.11.2018]

152

�Nanda A., 2019 Transaction Management with LogMiner and Flashback
Data Archive, Available from: http://www.oracle.com/us/solutions/11gtransactionmanagement-092065.html [Accessed on 20.11.2018]
Nanda A. and Burleson D.K., Oracle Privacy Security Auditing, Rampant
Techpress, 2003
National Institute of Justice. (2004). Forensic Examination of Digital
Evidence:

A

Guide

for

Law

Enforcement.

Available

from:

https://www.ncjrs.gov/pdffiles1/nij/199408.pdf
Nelson, B., Phillips A., &amp; Steuart C. (2015). Guide to Computer Forensics
and Investigations (5th ed.). Course Technology. Available from:
https://books.google.ba/books?id=PUh9AwAAQBAJ&amp;pg=PA137&amp;dq=
what+is+digital+evidence+SWGDE&amp;hl=en&amp;sa=X&amp;ved=0ahUKEwii87
LhrqnRAhUCVhQKHTsIAb4Q6AEIMTAB#v=onepage&amp;q&amp;f=false
NIST. (2004). Digital Data Acquisition Tool Specification, Public Review
of Version 4.0. Available from: http://www.cftt.nist.gov/Pub-Draft-1DDA-Require.pdf
NIKSUN NetDetector, 2018 Available at:
https://www.phoenixdatacom.com/product/niksun-netdetector-packetcapture-network-security-forensics/ [Accessed on 20.12.2018]
Open University, 2018, Different types of digital forensic, Available at:
https://www.open.edu/openlearn/science-maths-technology/digitalforensic/content-section-4.3, [Accessed on 20.12.2018]
153

�(Oracle, pp. 79) Database Administrator's Guide, Available at:
https://docs.oracle.com/cd/B28359_01/server.111/b28310/onlineredo001
.htm#ADMIN11302 [Accessed on 15.02.2019]
Oracle

Fine

Grained

Auditing,

Available

at:

https://www.oracle.com/technetwork/database/security/index083815.html2019 [Accessed on 18.02.2019]
Oracle

DBA_FGA_AUDIT_TRAIL

Available

at:

https://docs.oracle.com/cd/B19306_01/server.102/b14237/statviews_311
5.htm#REFRN23075 [Accessed on 18.02.2019]
Oracle

LogMiner,

2019,

Available

at:

https://www.oracle.com/technetwork/database/features/availability/logm
ineroverview-088844.html, [Accessed on 25.03.2019]
Pollit, M. (2017, January 15). A history of digital forensics. Available
from:
https://pdfs.semanticscholar.org/0d15/132439fc1de82724dd06effff5a782
eefeac.pdf
Recombu.

(2017),

Android

updates,

Available

from

https://recombu.com/mobile/article/what-is-android-and-what-is-anandroid-phone_M12615.html , accessed 25.09.2017
Renkforce, 2019 Available at: https://www.conrad.com/p/renkforce-rfdocking-06-usb-30-esata-sata-4-ports-hdd-docking-station-1305502
[Accessed on, 14.03.2019]
154

�Road

MASSter

2,

2019

Available

at:

http://dfrt.blogspot.com/2007/01/forensic-tools-hardware.html [Accessed
on, 01.03.2019]
Roy, NR. Khanna, AK. Aneja, L (2016), Android Phone Forensic: Tools
and Techniques International Conference on Computing, Communication
and

Automation

(ICCCA2016)

Available

from

http://ieeexplore.ieee.org/document/7813792/
Ryder, K. (2002). Computer Forensics – We’ve Had an Incident, Who Do
We Get to Investigate? SANS Institute InfoSec Reading Room. Available
from:

https://www.sans.org/reading-

room/whitepapers/incident/computer-forensics-weve-incidentinvestigate-652
ShareTechnote.

(2017),

Android

ADB,

Available

from

http://www.sharetechnote.com/html/Android/Android_ADB.html
accessed 25.9.2017
Sapir, G.I. (2007, January 2). Qualifying the Expert Witness: A Practical
Voir
Dire.
Forensic
magazine.
Available
from:
http://www.forensicmag.com/article/2007/01/qualifying-expert-witnesspractical-voir-dire

Singh, N and, Bansal, R. (2015), Analysis of Benford’s Law in Digital
Image Forensics, Signal Processing and Communication (ICSC), 2015
International Conference
155

�Sophos. (2018), 2018 Malware Forecast: ransomware hits hard,
continues to evolve, Available from https://news.sophos.com/enus/2017/11/02/2018-malware-forecast-ransomware-hits-hard-crossesplatforms/ accessed 6.1.2018
Smith, W. (1867). Dictionary of Greek and Roman Biography and
Mythology Vol 1. Boston: Little Brown and Company p209

SNOW,

2019,

The

SNOW

Home

Page,

Available

at:

http://www.darkside.com.au/snow/ [Accessed on, 14.03.2019]
Startribune. (2018), Minnesota detectives crack the case with digital
forensics, Available from http://www.startribune.com/when-teens-wentmissing-digital-forensics-cracked-case/278132541/ accessed 10.1.2018
SWGDE, (2013) Best Practices for Computer Forensic, Scientific
Working Group on Digital Evidence, Version: 3.0 (September 14, 2013)
Available

at:

https://www.swgde.org/documents/Archived%20Documents/SWGDE%
20Best%20Practices%20for%20Computer%20Forensic%20v3-0,
[Accessed on, 29.10.2018]
UNODC. (2013). Comprehensive Study on Cybercrime. Available from:
https://www.unodc.org/documents/organizedcrime/UNODC_CCPCJ_EG.4_2013/CYBERCRIME_STUDY_210213.
pdf
156

�UNODC. (2013). Comprehensive Study on Cybercrime. (V.13-80699)
Vienna: United nations office on drugs and crime

UN. (2000). Tenth United Nations Congress on the Prevention of Crime
and the Treatment of Offenders. (A/CONF.187/10). Available from:
https://www.asc41.com/UN_Congress/10th%20UN%20Congress%20on
%20the%20Prevention%20of%20Crime/013%20ACONF.187.10%20Cr
imes%20Related%20to%20Computer%20Networks.pdf
Vandeven, S. (2014). Forensic Images: For Your Viewing Pleasure. SANS
Institute

InfoSec

Reading

Room.

Available

from:

https://www.sans.org/reading-room/whitepapers/forensics/forensicimages-viewing-pleasure-35447 [Accessed on, 15.01.2019]
Xplico

(2019)

Available

at:

http://www.xplico.org/wp-

content/uploads/2008/11/xwi_email.png [Accessed on, 29.01.2019]
Whitecomb, C.M. (2002). An Historical Perspective of Digital Evidence:
A Forensic Scientist’s View. International Journal of Digital Evidence
1(1),1-3
Watson, D.A., Jones, A. (2013). Digital Forensics Processing and
Procedures: Meeting the Requirements of ISO 17020, ISO 17025, ISO
27001 and Best Practice Requirements (1st ed.). London: Elsevier /
Syngress.
Wiley Carol, What Is the Difference Between Computer Forensic &amp;
Digital Forensic? Available at: https://careertrend.com/facts-6733855157

�difference-computer-forensic-digital-forensic-.html

[Accessed

on,

29.01.2019]
Williams A., Leaving a trace: Forensic science through history, BBC,
Available

at:

https://www.bbc.com/timelines/zcq2xnb#zgsg4wx,

[Accessed on, 29.10.2018]
Witte

de

With,

2019

https://www.wdw.nl/en/participants/rodolphe_archibald_reiss [Accessed
on, 29.10.2018]
Wright, Paul M. “Oracle forensic” Oracle security best practice, Rampant
Techpress; May 2007.
Yeatts, T. (2001) Forensics: Solving the Crime, Available from:
http://connection.ebscohost.com/c/articles/15721149/chapter-one-jamesmarsh-toxicology

158

�Index

A
Access control
Active attack
Administrator and operator logs
Applications
Architecture
Artificial
Assessment
Asset
Attacker
Audit
Audit logging
Authenticity
Availability

B
Business Continuity
Business continuity and risk
assessment
Business continuity management
Business continuity planning
framework

C
Change control procedures
Change management
Clock synchronization
COBIT
Communication
Communications and operations
management
Compliance
Computer
Confidentiality
Continuity
Control of internal processing
Control of operational software
Control of technical vulnerabilities
Controls against malicious code

Controls against mobile code
Countermeasure
Crypto

D
Denial of service
Developing and implementing BCP
including information security
Disaster
DMZ
Distance vector

E
Electronic
Electronic messaging
Electronic commerce
Equipment identification in the
network
Encryption
Escalation

F
Fault
Fault logging
Firewall
Forensic
FTP

G
Gap analysis
Goal, Goals

H
Hardware
Human
Human resources
HRA
HTTP

159

�I

O

Incident
Including information security in the
BCM process
Information access restriction
Information Backup
Information security
Information security incident
management
Information systems acquisition,
development and maintenance
Infrastructure
Input data validation
Integrity
Interruption
Intrusion detection
IP address
IPX
ISMS
ISO 27000
ITIL

OID
On-line transactions
Output data validation

P

K

Passive attack
Password management system
Performance
Physical and environmental security
Policy on the use of cryptographic
controls
Policy on use of network services
Privilege management
PRA
Proactive
Procedure
Protection of information systems
audit tools
Protection of log information
Protection of system test data
Protocol
Publicly available systems

Key management
KPI

Q

L
Limitation of connection time
Local area networks

M
MAC address
Management
Media
Message integrity
Metric,
Monitoring system use

N
Network
Network controls
Network connection control
Network layer
Network routing control
NMS
Non-Reputability

160

QoS
Quality
Qualitative
Quantitative

R
Recovery
Regulation of cryptographic controls
Regulatory
Remote diagnostic and configuration
port protection
Responsibilities and procedures
Restrictions on changes to software
packages
Review of user access rights
Risk
Risk management
Router
RTGS

S
SABSA
Secure disposal

�Secure log-on procedures
Security
Security of network services
Security of system documentation
Security requirements analysis and
specification
Segregation in networks
Separation of development, test and
operational facilities
Server
Session time-out
SMTP
SNMP
Software
Spyware
SQL
Switch
SYN
System acceptance

VPN
Vulnerability

W
WAN
Web
Wide area networks
Wireless
Worm

X
XML

T
TCP / IP
Technical compliance checking
Technical review of applications
after operating system changes
Terminal
Testing, maintaining and reassessing business continuity plan
Threat
Trojan

U
UDP
Unicast
UPS
Use of system utilities
User authentication for external
connections
User identification and
authentication
User password management
User registration
Utilities

V
Virus
Virtual Private Network,
Visualisation

161

�162

�About authors
Kemal Hajdarevic PhD, received B.Sc. from the Faculty of Electrical
Engineering, University of Sarajevo, Bosnia and Herzegovina, M.Sc. and
PhD from Leeds Metropolitan University/Leeds Beckett University, Leeds,
UK. He is currently working at the Central Bank of Bosnia and Herzegovina
as a Senior Internal Auditor for information Security and IT projects, and he
has a teaching position at the Faculty of Electrical Engineering, University of
Sarajevo.
Nermin Ziga MSc, received MSc from International Burch University.
Nermin is an employee of Raiffeisen Bank, were he works as an Information
Security Officer within Raiffeisen Bank’s Security Department.
Mirza Halilovic MSc, received MSc and BSc from the Faculty of Electrical
Engineering, University of Sarajevo. Mirza is the Head of IT department for
monitoring, security, and data protection at BH Telecom d.d. Sarajevo.

163

�164

�Dr. Hamid Jahankhani: The area of “Digital Forensics” and its challenges, is clearly one
of the key issues facing both the scientific community, industries and other users alike.
Clearly understanding the digital forensics in a step by step format would help the
practitioners in this fast paced technology development era. I welcome this new book on
"Digital Forensics Essentials" which also aims to address some of the emerging issues.
Looking at the table of content there are clearly a number of interesting areas of research and
hence this book will undoubtedly help researchers and practitioners alike. To my opinion the
scope and coverage of this book adequately represent a balanced review of the digital forensics
subject. I feel the primary audience for this book would be Researchers, Practitioners, PhD
and Postgraduate students.
I highly recommend this book.
Dr. Jasmin Azemovic: We are facing turbulent events in cyberspace, and digital forensics
is on of dominant research topics which is continuously being updated with the latest
scientific advancements. Innovations in digital revolution are evident and this book will help
to face new challenges in digital era with goal to fight against crime in the cyberspace and
committed with, and against digital infrastructures.
Dr. Colin Pattinson: History has shown that, whenever a powerful new technology is
developed, the desire to misuse that power soon follows. The field of computer network
technology is no exception. Indeed IT misuse, whatever the underlying motivation, must be
one of most frequent forms of unwanted activity there is.
The ability to determine that an event has taken place, to learn from it and - hopefully - to
prevent it occurring again is a prime motivation for a forensic analysis. Understanding of
any losses have occurred, and building a legally sustainable case against the perpetrators
requires even higher levels of information gathering and retention. It is therefore important
that the skills and knowledge necessary to conduct such analysis are available to
organisations when needed.
This book provides a grounding in the tools and techniques necessary to investigate a range
of attacks, showing the importance of a structured, logical and methodical approach.
It is recommended for graduate students and those specialising in IT forensics.
1

�</text>
                  </elementText>
                </elementTextContainer>
              </element>
            </elementContainer>
          </elementSet>
        </elementSetContainer>
      </file>
    </fileContainer>
    <collection collectionId="4">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26318">
                  <text>BOOKS</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26311">
                <text>Essentials of Digital Forensics </text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26312">
                <text>Kemal Hajdarević, Nermin Ziga and Mirza Halilovic</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26313">
                <text>Information available on Internet Live Stats web site&#13;
(www.internetlivestats.com) that 40 percent of world’s population is&#13;
using Internet Media almost daily reports on different cyber and digital&#13;
security incidents. Many more similar incidents have never been reported&#13;
or they have been reported years after they had occurred due to the fact&#13;
that they could have jeopardised ongoing law enforcement investigations&#13;
or because they could have been embarrassing and thus negatively affect&#13;
reputation of the victim – organisation or a person.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26314">
                <text>digital forensics</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26315">
                <text>International Burch University</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="40">
            <name>Date</name>
            <description>A point or period of time associated with an event in the lifecycle of the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26316">
                <text>July, 2019</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3446" public="1" featured="1">
    <fileContainer>
      <file fileId="4248">
        <src>https://omeka.ibu.edu.ba/files/original/b35517f5a6b3b1b5430cbd3beb65665b.docx</src>
        <authentication>3d88bd71483956752150c8fc8f575fa4</authentication>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26305">
                <text>THE RETURN OF THE MODERN&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26306">
                <text>LUKA KORLAET </text>
              </elementText>
              <elementText elementTextId="26307">
                <text>SVEBOR ANDREJEVIC</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26308">
                <text>The second half of nineties and twenties were the building Eldorado: the bank loans were shared without any delays to entrepreneurs and apartment buyers. It was built a lot and very uncertainly. Speaking of Zagreb, the slopes of Medvenice were especial ly attractive. That space was built with numerous urban vil las, the smal l multi-storey buildings, which its investors and architects were led by exploation logic and rarely think about quality of floor plans or about environment between buildings.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26309">
                <text>International Burch University</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26310">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3445" public="1" featured="1">
    <fileContainer>
      <file fileId="4247">
        <src>https://omeka.ibu.edu.ba/files/original/d98c62dc674b330134eb6d0d5eec8c89.docx</src>
        <authentication>b7b81d52bcb066ce1b2d555ea001830b</authentication>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26297">
                <text>Communication in Smart Homes with Emphasis wn Power Line Communication&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26298">
                <text>Ina Salihović</text>
              </elementText>
              <elementText elementTextId="26299">
                <text>Esma Musić</text>
              </elementText>
              <elementText elementTextId="26300">
                <text>Dejan Jokić</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26301">
                <text>Abstract: Power Line Communication (PLC) is a technology that allows consumers to use the already existing wiring infrastructure to exchange information. This paper overviews narrowband PLC in home automation, starting from the basics of power line communication and its advantages compared to wired and Wi-Fi automation systems, data modulation techniques, noise problems, frequency bands, all the way to regulations affecting PLC. The paper is finished off with an overview of three System on Chip (SoC) power line modems from a few different generations, Yitran’s IT800D from 2005, ON Semiconductor’s NCN49597 from 2012, and STMicroelectronics’s ST8500 from 2017.&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26302">
                <text>Keywords: Power Line Communication, narrowband PLC, PLC modem, System on Chip&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26303">
                <text>International Burch University</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26304">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3444" public="1" featured="1">
    <fileContainer>
      <file fileId="4246">
        <src>https://omeka.ibu.edu.ba/files/original/ced592d00fbb324ea8ec5b8c3f4b59bf.doc</src>
        <authentication>2fed9cf32cceb951fd4ffd11d4bc1b4e</authentication>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26289">
                <text>Review of Traffic Data Analysis for Accident Management&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26290">
                <text>Samir Kurtanovic</text>
              </elementText>
              <elementText elementTextId="26291">
                <text>Nejdet Dogru</text>
              </elementText>
              <elementText elementTextId="26292">
                <text>Zerina Masetic </text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26293">
                <text>Abstract - In the past decade transportation systems have been augmented with information and communication technology to provide innovative services to the participants in the traffic. This synergy has resulted in safer and more optimized transportation network. In past few decades, newly developed systems in the area of transportation have been collectively called the Intelligent Transportation Systems (ITS). ITS can be defined as a holistic, control and information and communication upgrade of the classical traffic and transportation system that achieves significantly improved performance, traffic flow, more efficient passenger and goods transport, improved traffic safety, comfort and passenger protection, and reduction of environmental pollution. The interest in ITS comes from problems caused by traffic jams, traffic accidents, environmental concerns, congestions, delays and the synergy of latest information technology for simulation, real-time control and communication networks. Traffic accident management is one of the main focus fields of ITS due to the severe consequence that the accidents have. This paper surveys the traffic accident relation studies in ITS.&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26294">
                <text>Keywords: Intelligent Transportation System, Accident Detection, Traffic Data Analysis, Traffic Flow Modelling, Traffic Monitoring&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26295">
                <text>International Burch University</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26296">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
  <item itemId="3443" public="1" featured="1">
    <fileContainer>
      <file fileId="4245">
        <src>https://omeka.ibu.edu.ba/files/original/2dca31036e8f1444f307dfc65d47b3b8.docx</src>
        <authentication>44ed7242bd6e58db28a43c8ae6198ec6</authentication>
      </file>
    </fileContainer>
    <collection collectionId="3">
      <elementSetContainer>
        <elementSet elementSetId="1">
          <name>Dublin Core</name>
          <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
          <elementContainer>
            <element elementId="50">
              <name>Title</name>
              <description>A name given to the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26245">
                  <text>Journal of Natural Sciences and Engineering</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="43">
              <name>Identifier</name>
              <description>An unambiguous reference to the resource within a given context</description>
              <elementTextContainer>
                <elementText elementTextId="26605">
                  <text>2637-2835</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="98">
              <name>DOI</name>
              <description>Digital object identifier</description>
              <elementTextContainer>
                <elementText elementTextId="26606">
                  <text>10.14706</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="45">
              <name>Publisher</name>
              <description>An entity responsible for making the resource available</description>
              <elementTextContainer>
                <elementText elementTextId="26607">
                  <text>International Burch University</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="41">
              <name>Description</name>
              <description>An account of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26608">
                  <text>Journal of Natural Sciences and Engineering (JONSAE) is a peer-reviewed, biannually published international journal focusing on empirical and theoretical research in all branches of Engineering and Natural Sciences. It is published on the behalf of Faculty of Engineering and Natural Sciences of International Burch University and aims to provide the best content regarding by publishing original research papers, review articles, special issues, feature articles, and book reviews. All manuscript submissions are subject to initial appraisal by the Editor, and, if found suitable for further consideration, to peer review by independent, anonymous referees. All peer review is double-blind and submission is online. The journal welcomes theoretical, applied, interdisciplinary and methodological work, with preference on empirical research, critical approach and problem-solving methods in manuscripts.</text>
                </elementText>
              </elementTextContainer>
            </element>
            <element elementId="44">
              <name>Language</name>
              <description>A language of the resource</description>
              <elementTextContainer>
                <elementText elementTextId="26609">
                  <text>English</text>
                </elementText>
              </elementTextContainer>
            </element>
          </elementContainer>
        </elementSet>
      </elementSetContainer>
    </collection>
    <elementSetContainer>
      <elementSet elementSetId="1">
        <name>Dublin Core</name>
        <description>The Dublin Core metadata element set is common to all Omeka records, including items, files, and collections. For more information see, http://dublincore.org/documents/dces/.</description>
        <elementContainer>
          <element elementId="50">
            <name>Title</name>
            <description>A name given to the resource</description>
            <elementTextContainer>
              <elementText elementTextId="26281">
                <text>Optimal Network Reconfiguration of the Distribution Network for Minimization of Power Loss and Voltage Deviation using NSGA-II Algorithm</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="96">
            <name>Author</name>
            <description>Author</description>
            <elementTextContainer>
              <elementText elementTextId="26282">
                <text>Lejla Bandić</text>
              </elementText>
              <elementText elementTextId="26283">
                <text>Naida Nalo</text>
              </elementText>
              <elementText elementTextId="26284">
                <text>Jasmin Kevrić</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="94">
            <name>Abstract</name>
            <description>A summary of the resource.</description>
            <elementTextContainer>
              <elementText elementTextId="26285">
                <text>Abstract - For efficient power flow in the distribution system, one of the most important components is the accurate loss and voltage deviation minimization. This work gives an insight into the minimization process of the loss and voltage deviation in the power distribution system by using the network reconfiguration. Non-dominated sorting genetic algorithm – II (NSGA-II) has been used as the tool for solving the problem. The power flow is incorporated with the Genetic Algorithm until the best results are obtained. For power flow calculation, Power System Analysis Toolbox (PSAT) is used, while the whole algorithm was written in MATLAB. The proposed algorithm is tested on a real 10 kV distribution network, Gracanica. Results obtained show that the presented methodology can be efficiently applied to reconfigure distribution networks to select the optimal distribution network topology in order to achieve savings in power losses and improve voltage profile.</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="97">
            <name>Keywords</name>
            <description>Keywords.</description>
            <elementTextContainer>
              <elementText elementTextId="26286">
                <text>Keywords - Non-dominated sorting genetic algorithm–II (NSGA-II), reconfiguration, power loss, voltage profile, PSAT.&#13;
</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="45">
            <name>Publisher</name>
            <description>An entity responsible for making the resource available</description>
            <elementTextContainer>
              <elementText elementTextId="26287">
                <text>International Burch University</text>
              </elementText>
            </elementTextContainer>
          </element>
          <element elementId="48">
            <name>Source</name>
            <description>A related resource from which the described resource is derived</description>
            <elementTextContainer>
              <elementText elementTextId="26288">
                <text>Journal of Natural Sciences and Engineering</text>
              </elementText>
            </elementTextContainer>
          </element>
        </elementContainer>
      </elementSet>
    </elementSetContainer>
  </item>
</itemContainer>
