Stock Transactions Dataset
Description
As the number of investment transactions grows, so does the importance of visual analysis to study finance data. Despite modern stock market platforms and research tools offering a range of stock data and visual analysis software, retail investment data is difficult to find due to privacy and security concerns. This challenge poses barriers to researchers and analysts interested in portfolio management, analysis, and visualization. This is the first open and anonymized dataset of investment transactions. This freely accessible dataset can be used to study investment portfolio analysis, thereby improving strategic decision-making in portfolio management. It features a comprehensive set of investment transactions focused on the U.S. stock market, encompassing the transaction records of an anonymous investor over 3-4 years, complemented by derived metadata on the stocks of interest. We are confident that the accessibility of this open data will significantly contribute to the research community, fostering enhanced exploration in the field of investment.
Files
Steps to reproduce
The raw investment data consists of over 2,700 online brokerage transaction records over three years spanning 2020–2024. Anonymization: We have taken steps to anonymize the data, including removing all identifier information such as name and original account numbers. Without an associated name or ID it is not possible to identify any individual retail investor since there are billions of stock market transactions every day. Each transaction features the following fields: 1. Action: describes the type of transaction that occurred: • Market Buy, • Market Sell, • Dividend: There are seven different types of dividends: bonus, demerger, dividends paid by foreign corporations, dividends paid by US corporations, ordinary manufactured payment, ordinary return of capital non-US, interest on cash, • Deposit, • Withdrawal, 2. Time: the time of the transaction, 3. ISIN: International Securities Identification Number, a unique code that identifies a globally tradable security, 4. Ticker: the stock ticker symbol, this is a code used to uniquely identify a specific stock across trading platforms, 5. Name: the name of the stock, typically the name of a company, 6. No. of shares: the number of shares involved in the transaction, 7. Price/share: the transaction price per share 8. Currency: the currency of the transaction 9. Exchange rate: the exchange rate at the time of the transaction, 10. Result (GBP): the profit or loss result when selling stocks, in British pounds 11. Total (GBP): the total transaction amount in British pounds. The transaction records contain 10 additional fields providing data on tax fees, transaction IDs, and additional notes. A description of these is provided in the dataset itself.
Institutions
- University of Nottingham