LogoLogo
HomePricingDocumentation
  • ๐Ÿ’ฟGetting Started
    • Installation and Project Setup
    • Hello Perigee!
    • Perigee Application Design
    • Hello Configuration
    • Hello Logs
    • Hello Integration
    • Troubleshooting
    • Case Studies
  • ๐Ÿ“ƒLicense + Notice
    • ๐Ÿ“‚Licensing
    • Notice of Third Party Agreements
  • ๐Ÿš€Perigee and Beyond
    • Extending - Threads
    • Extending - Loaders
    • โณAll about CRON
  • ๐Ÿ”ฎAPI Generation
    • What is API Generation?
    • API Builder
  • ๐Ÿ—บ๏ธArchitecting YOUR App
    • Design and Requirements
    • Define Sources
    • Requirements
  • ๐ŸงฉCore Modules
    • ๐ŸŒPerigeeApplication
    • ๐ŸชกThread Registry
    • Event Sources
      • Scheduled/Logic
        • CRON Thread
        • Scheduler
        • Sync Agent
      • Watchers
        • SalesForce
        • Sharepoint
        • Directory Watch
        • Directory Notifier
        • IMAP
    • Credential Management
      • Connection Strings
      • Custom Refresh Logic
      • RestSharp Authenticator
      • Credential Store SDK
      • โ‰๏ธTroubleshooting Credentials
    • Integration Utilities
      • HTTP(S) - RestSharp
      • Transaction Coordinator
      • Limiter
      • Watermarking
    • Alert Managers
      • SMS
      • Email
      • Discord
      • Teams
    • File Formats
      • Excel
      • CSV
    • ๐Ÿ“File System Storage
      • File Revision Store
      • Concurrent File Store
      • FileSync + Cache
    • Third Party
      • SmartSheets
      • Microsoft Graph
    • Perigee In Parallel
      • Parallel Processing Reference
      • Extensions
      • GroupProcessor
      • SingleProcessor
    • ๐ŸงฑUtility Classes
      • Metrics
      • F(x) Expressions
      • Multi-Threaded Processor (Scatter Gather)
      • OpenAI - GPT
      • XML Converter
      • Dynamic Data Table
      • Debounce
      • Thread Conditions
      • Perigee Utility Class
      • Network Utility
      • Lists
      • FileUtil
      • Inclusive2DRange
      • Strings, Numbers, Dates
      • Nested Sets
      • Behavior Trees
      • JsonCompress
      • Topological Sorting
      • DBDownloader
    • ๐ŸˆBit Serializer
  • ๐Ÿ“ฃExamples and Demos
    • API + Perigee
    • ๐Ÿ“ฐExcel Quick Load
    • SalesForce Watcher
    • Report Scheduler
    • Agent Data Synchronization
    • ๐Ÿ“ฉIMAP Echo bot
    • Watch and load CSVs
    • Graph Delegated Authorization + DataVerse
    • Coordinator Demo
    • Azure Service Bus
    • QuickBooks Online
  • ๐Ÿ“˜Blueprints
    • Perigee With .NET Hosting
    • Web Host Utilities
    • ๐Ÿ”ŒPlugin Load Context
  • ๐ŸŽž๏ธTransforms
    • ๐ŸŒŸWhat is Transforms?
    • ๐Ÿ“˜Terminology
    • ๐ŸฆพThe Mapping Document
    • ๐Ÿ‘พTransformation Process
    • ๐Ÿ˜ŽProfile
    • ๐ŸŽ’Automation
      • ๐Ÿ•“Package Options
      • ๐Ÿ”ณConfiguration
    • ๐Ÿ”งUtilities
      • ๐ŸงนClean
      • ๐Ÿ“‘Map File
      • ๐Ÿ”ŽFile Identification
      • ๐Ÿ—บ๏ธMap Generation
      • ๐Ÿช…Insert Statement Generation
  • ๐Ÿ—ƒ๏ธTransform SDK
    • ๐Ÿ‘‹Quick Start Guide
    • ๐ŸฅณMapTo
    • ๐Ÿ”ŒAuthoring Plugins
      • ๐Ÿ”˜File IO Process
      • ๐Ÿ“ขData Quality
      • ๐ŸŸขTransform Process
    • SDK Reference
      • ๐Ÿ”˜FileIOProcessData
      • ๐Ÿ“ขDataQualityContext
      • ๐ŸŽ›๏ธTransformDataContext
      • ๐Ÿ…TransformResult
Powered by GitBook
On this page
  • File Cleaning
  • Options
  • A demo Property File
Export as PDF
  1. Transforms
  2. Utilities

Clean

PreviousUtilitiesNextMap File

Last updated 11 months ago

File Cleaning will attempt to find and clean an input file to produce a cleaned, header free, extra newline removed file.

File Cleaning

File cleaning is integrated into every layer of Transforms. Although there is a designated section for cleaning and re-downloading files, every file that is uploaded is automatically cleaned prior to use.

This means that even if you drop a "dirty" sample property file into a new transform, it will still be cleaned before any transformations are applied.

Options

There are several options that are possible to set before cleaning the file.

  • "Remove rows where column A is blank?" - This feature serves as a filter to delete rows. It checks the first column (Column A) for any cells that are empty or contain no data and removes the entire row if any are found.

  • "Remove rows where amount (sum) is zero" - Useful for financial type files where the sum of all budget, amount, and total columns is zero.

  • Header Row - Auto-Detect is the default setting and tries to identify header information automatically before processing the data. If you know the precise location of your data or if the auto-detection is incorrect, you can specify the exact row here before uploading your file.

  • Custom File Process - These custom processes are developed by the Perigee Team and consist of specific, targeted operations designed to convert poorly formatted data or various formats into a format that is compatible with the Perigee Transforms application.

    • If you would like your own processes written, or have custom processes you would like loaded into the system and available for your team, please contact us at sales@perigee.software .

A demo Property File

In this example our file contains extra "garbage" information above the real data. Much like a real "client file" we've all dealt with. It also has extra newlines between rows, and is generally pretty messed up for an automated system to read.

After sending it through the clean process, it produces a very clean, no-nonsense file that is capable of being read by most automated programs.

๐ŸŽž๏ธ
๐Ÿ”ง
๐Ÿงน
614B
Property.csv
Demo property file