What services does ISCON Statistical Consulting Services offer?

ISCON provide a wide range of statistical consulting and analysis services which include but not limited to the selection of research design, data analysis and results interpretation. We can also assist you in designing, conducting and analysing survey research. Our expert assistance is available for an extensive range of areas including grant proposal development, clinical trial design, scientific research design, strategies for data collection, determination of sample size and power, statistical modelling, results interpretation, preparation of manuscripts, data entry and management to some extent. We are specialists in epidemiology studies, clinical trials, statistical genetics, non-parametric methodology, genetic epidemiology, regression and time series analysis, Bayesian methods and graphical methodologies. Moreover, we offer support to non-profit organisations for NSF, private foundation and NIH grants.

How can I book appointment with Statistical consultant?

You can contact us via email or phone call to book your appointment for consultation

What information should I bring along at an initial consultation meeting with statistician?

When you visit us for the initial meeting, it is recommended that you bring along all the information related to your research and can help us in better understanding the goals and purpose of your research study. This information can include research hypotheses, relevant literature review papers, proposal drafts or manuscripts and anything you think is important for us to know about. If you have done data collection already, we request you to give us your data set copy in electronic form along with all the relevant study information.

How much statistics should I know?

It is expected that you at least have basic knowledge of statistical concepts and methodologies. However, our expert consultant helps you to explore the available options.

How much is the cost to hire statistician?

The cost of our service depends on a variety of factors which include your data format, complexity, data cleanliness, project deliverables and required analysis. We work with you to understand your requirements and breakdown the cost to meet your individual needs.

What happens at the initial consultation session with statistician?

In the initial consultation session, the consultant introduces him/herself and asks you a few general questions about your research, data, objectives and requirements. Through this, we try to understand your needs and work with you to answer your concerns.

What is the cost of the initial consultation?

At ISCON Statistics, we provide a free initial consultation at our office or through video-calling. However, if you request us to come over to you, then you have to cover our trip expenses.

At what research stage should I contact statisticians?

The best time to contact statistician is before designing your research study. We can help you in designing your research study and determine the most effective and powerful statistical methodologies. However, you can contact us at any stage of your research.

Which statistical software statistician will use for data analysis?

Our experts have hands-on experience of using a variety of latest statistical software and tools, including R, WinBUGS, SPSS, SAS, and Stata.

What is your approach towards data privacy, security and confidentiality?

At ISCON Statistics, we follow strict data security measures and respect your privacy and confidentiality. We never share your information and data with anyone without your permission.

Are your services available for students?

Yes, we happily provide statistical support to the postgraduate students who do not have statistical expertise. We either provide them statistical advice or guidance to help them perform their statistical analysis or perform the statistical analysis of their research.

What format should I use to provide you data?

You can send us data in any format. However, CSV files, Excel files or SQL-based data files are recommended.

Reshape data from wide to long or from long to wide in R

Chetan Prajapati

Founder & Statistician at ISCON

Reshaping though frequently required in data analysis, so often it remains confusing even if you are frequent user ofreshapefunction. Here I have provided an simple example to elaborate more on each argument ofreshape.

Table of Contents

Reshape data from wide to long
Reshape data from long to wide

Reshape data from wide to long

Your data in wide form if the multiple observations of item, place or person (i.e. units) has been recorded in single row (but in multiple column). This multiple observations may be of repeated measure type (observation are made repeatedly at different time point) or multiple characteristics of some unit (eg. height, length and width of square).

Let’s take an example of wide data of repeated measure type. Person A is visiting clinic every month for their blood pressure check, and nurse note down the value for each month in single row which belong to this specific person A. See example below,

# wide data
df <- data.frame(matrix(data = NA, nrow = 3, ncol = 5, dimnames = list(NULL, paste0(c("id","name","jan","feb","march")))))
df[1,] <- c(1,"A",123,120,125)
df[2,] <- c(2,"B",140,150,155)
df[3,] <- c(3,"C",96,86,97)

Our “wide” data look like

knitr::kable(df, caption = "wide data")

Table 1:wide data
id	name	jan	feb	march
1	A	123	120	125
2	B	140	150	155
3	C	96	86	97

We want a data in which each new observation in new row but within same column i.e. long data. To achieve that we need to use standardreshapecommand inR. Thereshapehas following argument:

idvar: unique identifier for person,place or object on which observations(measurements) are made at different time points or repeatedly. ExampleCase ID
varying: if observation for specific individuals are made at different time points, in which columns values are recorded i.e time-varying columns. ExampleJan,Feb,March
timevar: what will be the name of column once the time-varying columns above has been staked in rows. ExampleMonth
times: what will be the values (of time) once the time-varying columns above has been staked in rows. ExampleJan,Feb,March
v.names: what will be the values (of observations) once the time-varying columns above has been staked in rows. ExampleBP
direction: data needs to converted from wide tolongformat.

df_long <- reshape(df,
idvar = "id",  
        #[unique identifier for person,place or object on which observations(measurments) are made at different time points or repeatedly]

varying = c("jan","feb","march"),  
        # [if observation for specific individulas are made at different time points, in which columns values are recorded i.e time-varying columns ]

timevar = "month", 
        # [what will be the name of column once the timevarying columns above has been staked in rows]

times = c("jan","feb","march"),
        # [what will be the values (of time) once the timevarying columns above has been staked in rows]

v.names = "BP", 
        # [what will be the values (of observations) once the timevarying columns above has been staked in rows]

direction = "long")   
        # [we want to convert wide df into long one])

Our “long” data look like

df_long <- arrange(df_long, id)

kable(df_long,format = "pandoc", caption = "long data")

Table 2:long data
id	name	month	BP
1	A	jan	123
1	A	feb	120
1	A	march	125
2	B	jan	140
2	B	feb	150
2	B	march	155
3	C	jan	96
3	C	feb	86
3	C	march	97

Sometime, not only one type of measurement (BP) but also other types (such as heart rate-HR) are measured and recorded row wise. For example,

# wide data
df <- data.frame(matrix(data = NA, nrow = 3, ncol = 8, dimnames = list(NULL, paste0(c("id","name","BP_jan","BP_feb","BP_march","HR_jan","HR_feb","HR_march")))))
df[1,] <- c(1,"A",123,120,125,72,70,71)
df[2,] <- c(2,"B",140,150,155,85,82,86)
df[3,] <- c(3,"C",96,86,97,65,52,59)

kable(df,format = "pandoc", caption = "wide data- multiple category")

Table 3:wide data- multiple category
id	name	BP_jan	BP_feb	BP_march	HR_jan	HR_feb	HR_march
1	A	123	120	125	72	70	71
2	B	140	150	155	85	82	86
3	C	96	86	97	65	52	59

This data can be converted into “long” by usinglistfor group of time-varying columns forvarying

df_long <- reshape(df,
idvar = "id",  
varying = list(c("BP_jan","BP_feb","BP_march"),c("HR_jan","HR_feb","HR_march") ),  
timevar = "month", 
times = c("jan","feb","march"),
v.names = c("BP","HR"), 
direction = "long")

df_long <- arrange(df_long, id)
kable(df_long,format = "pandoc", caption = "long data")

Table 4:long data
id	name	month	BP	HR
1	A	jan	123	72
1	A	feb	120	70
1	A	march	125	71
2	B	jan	140	85
2	B	feb	150	82
2	B	march	155	86
3	C	jan	96	65
3	C	feb	86	52
3	C	march	97	59

Reshape data from long to wide

To make data “wide” from long, thereshapefunction will need only two main arguments

idvar: unique identifier of unit on which measurement are made
timevar: which column representthe timingof the observations ( so thatreshapefunction associate it with the value for given time for each ID )

If you do not specify above two arguments, function will drop an error-

Error in [.data.frame (data, , idvar) : undefined columns selected

If you read above error carefully, it already specifying which arguments were missing. Here in above case missing argument wasidvar.

You can optionally provide,

v.names: which column representvaluesof the observations in long data (so thatreshapefunction can transform these values into rows for each ID)
sep: column names in wide format are going to be created using value oftimesvarandintegers. Specify how both will be seperated in column names.

Here is the example

df_wide <- reshape(df_long,
       idvar = "id",
       # unique identifier
       timevar = "month",
       # the column represent the timing of the observations
       v.names = c("BP","HR"),
       # the columns represent the value of the observation (BP,HR)
       direction = "wide",
       sep = "_"
        )

Here is the wide data

Table 5:wide data
	id	name	BP_jan	HR_jan	BP_feb	HR_feb	BP_march	HR_march
1	1	A	123	72	120	70	125	71
4	2	B	140	85	150	82	155	86
7	3	C	96	65	86	52	97	59

Reshape data from wide to long or from long to wide in R

Reshape data from wide to long

Reshape data from long to wide

Analytics cookies

Functining Cookies

Marketing cookies

Reshape data from wide to long

Reshape data from long to wide

Cookie Settings

Analytics cookies

Functining Cookies

Marketing cookies