question

Upvotes
Accepted
2 0 0 2

I want to get data using python API for multiple fields and RICs for a time period

The output that the get_history function returns is not in the best format. The first column contains dates, the first row contains RICs, and the second row contains fields. I want to have the data in the following format: the first column should contain dates, the second column should contain RICs, and the other columns should contain the field values corresponding to the date and RIC for each observation. How can I format it this way in Python? Additionally, there is another problem: the get_history function sometimes returns multiple rows for the same date and RIC, with some field values displayed in the first row and some field values displayed in the second row. I don't know whether this is a bug or a feature. The output format I'd like to have is in the picture.

1716699640313.png

eikon-data-apipython#technology#contentdata-type
1716699640313.png (114.9 KiB)
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

@vsafak

Please share the code that you are using to retrieve the data.

df = rd.get_history(universe=["SPYQ202454000.U^E24", "SPYQ202453900.U^E24"], fields= ['TR.OPENPRICE', 'TR.HIGHPRICE', 'TR.LOWPRICE', 'TR.CLOSEPRICE', 'TR.OPENINTEREST', 'TR.IMPLIEDVOLATILITY','TR.DELTA','TR.THETA','TR.GAMMA','TR.VEGA','TR.RHO'],

interval="1d",

start="2024-01-01",

end="2024-06-01")

1 Answer

· Write an Answer
Upvotes
Accepted
14.7k 30 5 10

Hi @vsafak ,

The dataframe can be formatted with Python code, for example

df = rd.get_history(universe=["SPYQ202454000.U^E24", "SPYQ202453900.U^E24"], fields= ['TR.OPENPRICE', 'TR.HIGHPRICE', 'TR.LOWPRICE', 'TR.CLOSEPRICE', 'TR.OPENINTEREST', 'TR.IMPLIEDVOLATILITY','TR.DELTA','TR.THETA','TR.GAMMA','TR.VEGA','TR.RHO'],
                    interval="1d",
                    start="2024-01-01",
                    end="2024-06-01")

format_df = pd.DataFrame()
idx = pd.IndexSlice
first_loop = True

for ric in rics:
    # get the dataframe of each RIC in the RICs list
    temp_df = df.loc[:,idx[ric,:]]
    temp_df.columns = temp_df.columns.droplevel()
    temp_df.reset_index(inplace=True)
    temp_df['RIC'] = ric

    # put this value in the format_df dataframe
    if first_loop:
        format_df = temp_df
        first_loop = False
    else:
        format_df = format_df.append(temp_df)
display(format_df)

1716909844640.png


1716909844640.png (48.6 KiB)
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Thanks. For those who are using pandas 2.0 or above, .append was replaced by ._append. After this minor modification, the code above works.

Write an Answer

Hint: Notify or tag a user in this post by typing @username.

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.