Metadata-Version: 1.1
Name: rawdata
Version: 0.0.9
Summary: Generate realistic raw datasets with optional DQ issues
Home-page: https://github.com/acutesoftware/rawdata
Author: Duncan Murray
Author-email: djmurray@acutesoftware.com.au
License: GNU General Public License v3 (GPLv3)
Description: =========================================
        Raw Data
        =========================================
        
        .. image:: https://badge.fury.io/py/rawdata.svg
            :target: http://badge.fury.io/py/rawdata 
        
        .. image:: https://landscape.io/github/acutesoftware/rawdata/master/landscape.svg?style=flat
           :target: https://landscape.io/github/acutesoftware/rawdata/master
           :alt: Code Health    
            
        Generate realistic raw datasets with optional DQ issues
        
        To install run 
        
        .. code:: python
        
            pip install rawdata
        
        
        
        Basic Usage
        ----------------
        
        Create a random table
        
        .. code:: python
        
            import rawdata.generate
            colLabel = ['Year', 'Name',   'Born']
            colTypes = ['DATE', 'PEOPLE', 'PLACE']
            tbl = rawdata.generate.TableGenerator(3, colTypes, colLabel)
            print(tbl)
        
                > Year, name,    Born
                > 2013, Douglas, Scandinavia
                > 1999, Hunter,  Sierra Leone
                > 2005, Shubha,  Madagascar
                
        Adding Errors to a table
        
        
        .. code:: python
        
            import rawdata.errors
            t = rawdata.errors.TableWithErrors(tbl, 'BAD_STRING')
            t.add_errors(2)
            print(t.tbl)
        
        And after adding 2 random errors there are additional spaces in Douglas, and the Born column is missing for Hunter
        
        
        .. code:: python
        
            Year    Name       Born
            -----   ---------  ----------
            2013     Douglas   Scandinavia
            1999    Hunter      
            2005    Shubha     Madagascar
        
        You can use columns generated via a custom list
        
        .. code:: python
        
        
            custom_list = ['Carved Statue', '1984 Volvo', '2 metre Ball of string']
            tbl = TableGenerator(5, ['PEOPLE', 'INT', custom_list], ['Name', 'Age', 'Fav Possession'])
            print(tbl)
                > Name,   Age,  Fav Possession
                > Inez,    58,  Carved Statue
                > Zane,    50,  2 metre Ball of string
                > Jered,   49,  1984 Volvo
                > Tameron, 55,  2 metre Ball of string
                > Wyatt,   68,  Carved Statue
        
        Other functions 
        
        .. code:: python
        
            import rawdata.generate
            n = rawdata.generate.NumberGenerator
            s = rawdata.generate.StringGenerator
        
            print('Random Number    = ', n.random_int(1,100))
                > Random Number    =  84
        
            print('Random Letters   = ', s.random_letters(40))
                > Random Letters   =  T1CElkRAGPAmWSavbDItDbFmQIvUh26SyJE58x49
        
            print('Random Password  = ', s.generate_password())
                > Random Password  =  peujlsmbf19966YKCX
        
            words = rawdata.generate.get_list_words()
            print(len(words), ' words : ', words[500:502])
                > 10739  words :  ['architeuthis', 'arcsine']
        
            places = rawdata.generate.get_list_places()
            print(len(places), ' places : ', places[58:60])
                > 262  places :  ['Brazil', 'British Virgin Islands']
        
        
        
        More information is at https://github.com/acutesoftware/rawdata
        
        
        
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Environment :: Web Environment
Classifier: Programming Language :: Python :: 3.4
Classifier: Intended Audience :: Developers
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Games/Entertainment :: Simulation
Classifier: Topic :: Software Development :: Libraries :: Application Frameworks
