Re: [EpiData-list] Missing data values when exporting to Stata
I will download the Analysis app and try that. I will also try to export to Stata 14 as the Stata version does not matter with regards to Python. Thanks
Kind Regards Henning Langen Stokmo
Consultant (Nuclear Medicine) // Department of Nuclear Medicine, Oslo University Hospital Radiology Resident // Department of Medical Imaging, Vestre Viken HF, Drammen Hospital PhD-student // Department of Nuclear Medicine, Oslo University Hospital | Oslo University
Functional and Molecular Imaging Research Group (https://www.ous-research.no/revheim/) ResearchGate: https://www.researchgate.net/profile/Henning_Langen_Stokmo
Sent from Canary (https://canarymail.io/)
On Friday, Aug 23, 2019 at 7:00 PM, <epidata-list-request@lists.umanitoba.ca (mailto:epidata-list-request@lists.umanitoba.ca)> wrote: Send EpiData-list mailing list submissions to epidata-list@lists.umanitoba.ca
To subscribe or unsubscribe via the World Wide Web, visit http://lists.umanitoba.ca/mailman/listinfo/epidata-list or, via email, send a message with subject or body 'help' to epidata-list-request@lists.umanitoba.ca
You can reach the person managing the list at epidata-list-owner@lists.umanitoba.ca
When replying, please edit your Subject line so it is more specific than "Re: Contents of EpiData-list digest..."
EpiData-list mailing list ___________________________________
Today's Topics:
- Re: Missing data values when exporting to Stata
(EpiData development and support) 2. Re: Missing data values when exporting to Stata (EpiData development and support)
Message: 1 Date: Thu, 22 Aug 2019 23:40:02 +0200 From: EpiData development and support epidata-list@lists.umanitoba.ca To: epidata-list@lists.umanitoba.ca Subject: Re: [EpiData-list] Missing data values when exporting to Stata Message-ID: 041A2CAB-A14A-4F26-8389-3410D8489A76@gmail.com Content-Type: text/plain; charset=utf-8
Sorry for not giving any information about the versions and system I use.
I use EpiData Manager v4.4.2.1 on a MacBook Pro 13? (2018) running Mojave 10.14.6 to export. The exported files are to Stata v12. The .dta files are opened in Python v3.6.8 via PyCharm IDE CE 2019.2 and imported to a Panda dataframe with the command ?pd.read_stata(<filename>). The empty lines in the Stata-file gets correctly set as ?NaN? in Pandas, but the value label for the missing category is set to ?100.0? instead of ?NaN? as I expect it should be after coding this in EpiData.
Kind Regards Henning L. Stokmo Phd-Student, Oslo University Hospital
On 22 Aug 2019, at 19:00, epidata-list-request@lists.umanitoba.ca wrote:
Send EpiData-list mailing list submissions to epidata-list@lists.umanitoba.ca
To subscribe or unsubscribe via the World Wide Web, visit http://lists.umanitoba.ca/mailman/listinfo/epidata-list or, via email, send a message with subject or body 'help' to epidata-list-request@lists.umanitoba.ca
You can reach the person managing the list at epidata-list-owner@lists.umanitoba.ca
When replying, please edit your Subject line so it is more specific than "Re: Contents of EpiData-list digest..."
EpiData-list mailing list ___________________________________
Today's Topics:
- Re: Missing data values when exporting to Stata
(EpiData development and support)
Message: 1 Date: Wed, 21 Aug 2019 21:43:41 +0200 From: EpiData development and support epidata-list@lists.umanitoba.ca To: epidata-list@lists.umanitoba.ca Subject: Re: [EpiData-list] Missing data values when exporting to Stata Message-ID: CE3439F1-72F7-4F8A-90F1-313DF876BD64@epidata.dk Content-Type: text/plain; charset=utf-8
As I remember the Stata value should be .a But please specify which version of Stata and if you are using Manager export or the new analysis, which may directly save the file in Stata format.
Best wishes Jens Lauritsen EpiData Association Denmark
Den 21. august 2019 14.04.31 CEST, EpiData development and support epidata-list@lists.umanitoba.ca skrev:
Hi,
I have some missing data that is encoded as ?9? in the database and in the value label list it is flagged as missing. When exporting to CSV the value is obviously set as ?9? as expected.
When I export it as Stata I assume this value ?9? gets encoded to Stata? definition of a missing value, but when I read the data in Python with the ?pd.read_stata?-function it defines the category value ?9? as 100.0 and not as missing data (NaN) like the blanks in EpiData.
Eksample One column has 80 values. 19 of these are encoded as ?9? (missing), 58 of these are blank, and 3 are coded as ?1? (Yes) in EpiData. After exporting to Stata and reading in Python I get
19 values in category ?100? 3 values in category ?Yes? 58 values NaN (i.e. missing)
Why aren?t the ?9? category correctly coded as missing when exporting to Stata, or am I missing something here?
Kind Regards Henning _______________________________________________ EpiData-list mailing list EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
-- Sent from my Android device with K-9 Mail. Please excuse my brevity.
EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
End of EpiData-list Digest, Vol 183, Issue 2
Message: 2 Date: Fri, 23 Aug 2019 11:21:07 +0200 From: EpiData development and support epidata-list@lists.umanitoba.ca To: epidata-list@lists.umanitoba.ca Subject: Re: [EpiData-list] Missing data values when exporting to Stata Message-ID: C7C018D0-96BA-45C8-8AFF-7F9FC92D31AF@epidata.dk Content-Type: text/plain; charset=utf-8
You spotted an error in export to Stata 12.
If you export to Stata 14 it works as desired.
A easier way is to use the new Analysis for export, Analysis has options for choice of Stata version a d works on a Mac.
Can the python use Stata 14?
Best wishes Jens Lauritsen EpiData Association Denmark
Den 22. august 2019 23.40.02 CEST, EpiData development and support epidata-list@lists.umanitoba.ca skrev:
Sorry for not giving any information about the versions and system I use.
I use EpiData Manager v4.4.2.1 on a MacBook Pro 13? (2018) running Mojave 10.14.6 to export. The exported files are to Stata v12. The .dta files are opened in Python v3.6.8 via PyCharm IDE CE 2019.2 and imported to a Panda dataframe with the command ?pd.read_stata(<filename>). The empty lines in the Stata-file gets correctly set as ?NaN? in Pandas, but the value label for the missing category is set to ?100.0? instead of ?NaN? as I expect it should be after coding this in EpiData.
Kind Regards Henning L. Stokmo Phd-Student, Oslo University Hospital
On 22 Aug 2019, at 19:00, epidata-list-request@lists.umanitoba.ca
wrote:
Send EpiData-list mailing list submissions to epidata-list@lists.umanitoba.ca
To subscribe or unsubscribe via the World Wide Web, visit http://lists.umanitoba.ca/mailman/listinfo/epidata-list or, via email, send a message with subject or body 'help' to epidata-list-request@lists.umanitoba.ca
You can reach the person managing the list at epidata-list-owner@lists.umanitoba.ca
When replying, please edit your Subject line so it is more specific than "Re: Contents of EpiData-list digest..."
EpiData-list mailing list ___________________________________
Today's Topics:
- Re: Missing data values when exporting to Stata
(EpiData development and support)
Message: 1 Date: Wed, 21 Aug 2019 21:43:41 +0200 From: EpiData development and support epidata-list@lists.umanitoba.ca To: epidata-list@lists.umanitoba.ca Subject: Re: [EpiData-list] Missing data values when exporting to Stata Message-ID: CE3439F1-72F7-4F8A-90F1-313DF876BD64@epidata.dk Content-Type: text/plain; charset=utf-8
As I remember the Stata value should be .a But please specify which version of Stata and if you are using
Manager export or the new analysis, which may directly save the file in Stata format.
Best wishes Jens Lauritsen EpiData Association Denmark
Den 21. august 2019 14.04.31 CEST, EpiData development and support
epidata-list@lists.umanitoba.ca skrev:
Hi,
I have some missing data that is encoded as ?9? in the database and
in
the value label list it is flagged as missing. When exporting to CSV the value is obviously set as ?9? as expected.
When I export it as Stata I assume this value ?9? gets encoded to Stata? definition of a missing value, but when I read the data in Python with the ?pd.read_stata?-function it defines the category
value
?9? as 100.0 and not as missing data (NaN) like the blanks in
EpiData.
Eksample One column has 80 values. 19 of these are encoded as ?9? (missing),
58
of these are blank, and 3 are coded as ?1? (Yes) in EpiData. After exporting to Stata and reading in Python I get
19 values in category ?100? 3 values in category ?Yes? 58 values NaN (i.e. missing)
Why aren?t the ?9? category correctly coded as missing when
exporting
to Stata, or am I missing something here?
Kind Regards Henning _______________________________________________ EpiData-list mailing list EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
-- Sent from my Android device with K-9 Mail. Please excuse my brevity.
EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
End of EpiData-list Digest, Vol 183, Issue 2
EpiData-list mailing list EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
-- Sent from my Android device with K-9 Mail. Please excuse my brevity.
EpiData-list@lists.umanitoba.ca http://lists.umanitoba.ca/mailman/listinfo/epidata-list
End of EpiData-list Digest, Vol 183, Issue 3
participants (1)
-
EpiData development and support