Crypto Like

Crypto Like

1606095254

What is UNION Finance (UNN) | What is UNN token

DeFi’s astonishing pace of innovation is accompanied by a theatre of speculation, increasingly complex and interwoven smart contracts, and lucrative exploits. Proper risk management solutions for investors shouldering principal risks are currently insufficient to cover the progressive influx of capital into the market, leaving retail investors at the peril of the Ethereum’s Dark Forest and institutions with a dearth of options for hedging risk.

The risks facing DeFi are multi-faceted, nuanced, and novel. Vulnerabilities span multiple layers of a composable web of DeFi lego blocks, of which many are intertwined to complex degrees. The market presents several distinct challenges for perceiving and modelling risk in a permissionless financial system. Without sufficient tooling and infrastructure for risk management, institutional and retail capital will not be able to manage risk at a competitive level to centralized finance.

Risk management and sophisticated asset protection instruments are the next great frontier for DeFi that can usher in a wave of new capital and support the existing community as well. The potential design space is enormous. By fusing conventional coverage modelling with bundled protection products and a no-KYC platform, UNION aims to lay the foundation for advanced risk management products and grant a higher degree of comfort to participants.

We’re excited to push the boundaries of scalable, open asset protection in the DeFi market, and welcome the broader crypto community to join us on our journey.

UNN Sale + Preregistration

The token sale of UNN will begin on Sunday, November 22, 2020. The demand-driven pricing curve will utilize a linear approximation to offer the token price from $0.035 (entry price of the sale) to $0.50 (exit price of the sale).

Details about the UNN Token Sale:

  • Expected Start Date: Sunday, November 22, 2020
  • 50M UNN tokens for sale (5% of total supply)
  • Starting price: $0.035
  • Ending price: $0.50
  • Minimum purchase: $100
  • Maximum purchase: $87k
  • KYC is required
  • You will need a Metamask account
  • Restricted countries: The United States of America, The Cayman Islands, New Zealand, Canada, People’s Republic of China, Singapore, Republic of Korea, Belarus, Burundi, Central African Republic, Cuba, Iran, Libya, Democratic People’s Republic of Korea, Somalia, Sudan, Darfur, Mali, Republic of Guinea, Republic of Guinea-Bissau, Syria, Venezuela, Zimbabwe
  • No UNN sale participant or token holder will be able to sell or transfer their tokens until the sale has concluded.

Please have MetaMask installed in your browser BEFORE you begin the process. It is also recommended that you only use Chrome, Brave, or Firefox as your web browser with the MetaMask extension.

Shortly, we will be releasing an article with instructions and further details about participating in the sale.

Inspired by similar work by Ampleforth, following the sale, we will launch the UNN Geyser, which will reward liquidity providers of UNN tokens to Uniswap with additional UNN tokens for their active participation. More details about that below.

Characteristics of UNN

The UNN token is used for governance purposes, such as voting on protection claims and related conflict resolution, adjusting risk parameters, or adjusting incentive programs. Early token holders will have the ability to participate in a liquidity mining program (e.g., the UNN Geyser) and a Voluntary Lockup Arrangement, where bolstered incentives beyond the UNN Geyser will be granted based on fixed lock-up periods.

Details for both the UNN Geyser and the Voluntary Lockup Arrangement are in the next section.

There will be a fixed total of 1 billion UNN tokens. The UNN token forms the basis for the UNION governance ecosystem, driving the incentives for liquidity programs, yield farming, and protection coverage across the platform. The token distribution is as follows:

  • 300 million UNN provisioned for the network, ecosystem, and marketing. Examples include bug bounties, audits, community outreach, development of future products, etc.
  • 93.333 million UNN have been sold in the first round, sold at $0.0075 / UNN
  • 200 million UNN have been sold in the second round, sold at $0.0125 / UNN
  • 43.2million UNN have been sold in the third round, sold at $0.025 / UNN
  • 113.467 million UNN reserved for the treasury.
  • 50 million UNN provisioned for the November 22, 2020, UNN sale.
  • 200 million UNN reserved for the team and advisors.

Token Locks

UNN is a governance token with aspirations of being a staple DeFi token that is included in major DeFi platforms and protocols because of its deep liquidity. To achieve this we designed and built the project from the beginning to have no mechanical locks (meaning they can be moved freely by holders) beyond delivery following the public sale. Early investors have been selected based on their long term vision with the project.

UNN + pUNN + uUNN

The UNN token is the centrepiece of a multi-layered system using a three-token model.

The complimentary tokens to UNN are uUNN and pUNN. The UNN token is the only token available during the token sale. The uUNN and pUNN tokens are used within the capital and pricing model of the platform to divest the governance process from the UNION asset protection model. Separate tokens for governance and protection remove conflicts of interest arising from complex market dynamics, such as those experienced by NXM in September 2020.

Buyers of protection will receive uUNN tokens giving them rights to the protection policy. Writers of protection will receive pUNN tokens, representing the % share of the protection pool that the writer is powering. The pUNN protection pool incentives program is detailed below following the Voluntary Lockup Arrangement section.

More details about how the multi-layered governance, claims assessment, and protection process function are available in the official UNION whitepaper.

Geyser

To incentivize on-chain liquidity providers of UNN (starting with Uniswap), we will provide a UNN Geyser. We built the UNN Geyser using AmpleForth’s Geyser as our base reference.

  1. The program will run for 1 year with 3-month phases.
  2. Phase 1: 20% APY for 3 months.
  3. Phase 2: To be announced.
  4. The launch pairs on Uniswap will be UNN/ETH and UNN/USDT.

Voluntary Lock Incentive

This lock incentivizes community participation in the early days of the project when governance activity is expected to be muted. In other words, think of this as a precursor to our governance incentive program. In return for locking their tokens, during which they cannot exchange/send the tokens, the purchasers will receive enhanced UNN incentives above the Geyser amount.

  1. The lock-up periods will be instituted based on limited windows of specific days. Locks will be offered in 30, 60, and 120-day terms. The goal is to encourage the alignment of incentives for the community in the long-term.
  2. The locked balance will be compounded daily and deposited (locked) into their wallet.
  3. Once the lock is lifted, the purchaser can exchange/send all tokens (original amount + interest).

A comparison of the different lock returns along with Geyser (no lock) for theoretical 1,000 token allocations.

Image for post

Protection Writer (pUNN) Incentives

Image for post

In return for providing the liquidity for a protection pool, protection writers or pUNN tokens holders will receive for their particular protection pool:

  1. 80% of the premium paid by buyers of protection offered by the protection pool. pUNN will receive the payment in whatever the premium token is. As per the whitepaper, we will start with accepting payments in stablecoin, like DAI.
  2. The remaining 20% will be split as such:
  3. 10% to a reserve fund for the pool, aka the “rainy day” fund.
  4. 10% as foundation operations and reserve pool.
  5. 50% of returns generated by reserve pools for the pUNN’s protection pool, which hold funds in excess of MCR for said pool.
  6. The remaining 50% will be split as such:
  7. 40% deposited back into the reserve fund.
  8. 10% as foundation operations and reserve pool.

Future Incentives

  1. Governance: in tandem with our products going live, we will launch our governance incentive where UNN holders can lock their tokens for voting power and boosts to staking and/or pool returns.
  2. Lending: to launch in tandem with our OC protection. People can borrow against their UNN holdings.
  3. **Other: **we will run discretionary incentives and governance voted incentives as suited for the benefit of the ecosystem.

Roadmap

Our roadmap reflects a balance between meeting the pressing market demand for a suite of complete protection products and providing a robust and well-tested solution. Given the nature of the technology solution, we will always err on the side of conservative. Below roadmap is our aspirational target and may be adjusted based on progress and market demand.

Q4 2020:

  • UNN Sale
  • Private Development / Testing
  • DAO strategic partner acquisition

Q1 2021:

  • Production launch of Smart Contract Protection
  • Production launch of Over-Collateralization
  • Lending program
  • Governance launch

Q2 2021:

  • Production launch of Single-Step Gas Protection
  • Production launch of Layer-1 Protection (Protocol TBD)
  • A secondary market for protection

H2 2021 and beyond:

  • Advanced models for better risk management/pricing
  • New Protection: Multi-Step Gas, Bundled, Impermanent Loss (IL)
  • Expanded Protection coverage
  • Protection / Risk Management analytics for institutional entities

UNION Best Practices & Compliance Standards

Our general thesis on best practices is to deploy a series of state-of-the-art checks to ensure regulatory compliance and the privacy of user data. This begins even before our token sale, with our initial sales.

Steps that we take to ensure compliance and user data privacy include:

  • Thorough KYC/AML is required for all UNN sales. This prevents wallet taint from funding not recorded on the blockchain prior to the token sale as well as validating purchasers during the token sale. UNN utilizes the expertise of 3rd party KYC/AML parties to ensure the security and sanctity of funds received.
  • Multi-Vector KYC/AML — Liveliness, documents, and mobile phone verification. We utilize a 4-point check, including liveliness, which detects whether the person in front of the camera is real or just a picture of a picture.
  • GDPR Compliance — We maintain data using Cayman Data Retention Standards with the view to meeting GDPR compliance requirements. We have invested heavily in network and server security infrastructure, including intrusion detection, and system/network monitoring for all systems that collect and maintain purchaser data as part of the sale.
  • Geofencing — Our token sales site is geofenced, meaning that connection (and participation) to the sale are blocked from restricted countries. These include:
  • The United States of America, The Cayman Islands, New Zealand, Canada, People’s Republic of China, Singapore, Republic of Korea, Belarus, Burundi, Central African Republic, Cuba, Iran, Libya, Democratic People’s Republic of Korea, Somalia, Sudan, Darfur, Mali, Republic of Guinea, Republic of Guinea-Bissau, Syria, Venezuela, and Zimbabwe
  • Wallet Check — We vet wallets to ensure they are not a party to illicit activity.

Would you like to earn token right now! ☞ CLICK HERE

Looking for more information…

☞ Website
☞ Explorer
☞ Source Code
☞ Social Channel
☞ Documentation
☞ Coinmarketcap

Create an Account and Trade NOW

Bittrex
Poloniex
Binance

Thank for visiting and reading this article! I’m highly appreciate your actions! Please share if you liked it!

#bockchain #bitcoin #union finance #unn

What is GEEK

Buddha Community

What is UNION Finance (UNN) | What is UNN token
Crypto Like

Crypto Like

1606095254

What is UNION Finance (UNN) | What is UNN token

DeFi’s astonishing pace of innovation is accompanied by a theatre of speculation, increasingly complex and interwoven smart contracts, and lucrative exploits. Proper risk management solutions for investors shouldering principal risks are currently insufficient to cover the progressive influx of capital into the market, leaving retail investors at the peril of the Ethereum’s Dark Forest and institutions with a dearth of options for hedging risk.

The risks facing DeFi are multi-faceted, nuanced, and novel. Vulnerabilities span multiple layers of a composable web of DeFi lego blocks, of which many are intertwined to complex degrees. The market presents several distinct challenges for perceiving and modelling risk in a permissionless financial system. Without sufficient tooling and infrastructure for risk management, institutional and retail capital will not be able to manage risk at a competitive level to centralized finance.

Risk management and sophisticated asset protection instruments are the next great frontier for DeFi that can usher in a wave of new capital and support the existing community as well. The potential design space is enormous. By fusing conventional coverage modelling with bundled protection products and a no-KYC platform, UNION aims to lay the foundation for advanced risk management products and grant a higher degree of comfort to participants.

We’re excited to push the boundaries of scalable, open asset protection in the DeFi market, and welcome the broader crypto community to join us on our journey.

UNN Sale + Preregistration

The token sale of UNN will begin on Sunday, November 22, 2020. The demand-driven pricing curve will utilize a linear approximation to offer the token price from $0.035 (entry price of the sale) to $0.50 (exit price of the sale).

Details about the UNN Token Sale:

  • Expected Start Date: Sunday, November 22, 2020
  • 50M UNN tokens for sale (5% of total supply)
  • Starting price: $0.035
  • Ending price: $0.50
  • Minimum purchase: $100
  • Maximum purchase: $87k
  • KYC is required
  • You will need a Metamask account
  • Restricted countries: The United States of America, The Cayman Islands, New Zealand, Canada, People’s Republic of China, Singapore, Republic of Korea, Belarus, Burundi, Central African Republic, Cuba, Iran, Libya, Democratic People’s Republic of Korea, Somalia, Sudan, Darfur, Mali, Republic of Guinea, Republic of Guinea-Bissau, Syria, Venezuela, Zimbabwe
  • No UNN sale participant or token holder will be able to sell or transfer their tokens until the sale has concluded.

Please have MetaMask installed in your browser BEFORE you begin the process. It is also recommended that you only use Chrome, Brave, or Firefox as your web browser with the MetaMask extension.

Shortly, we will be releasing an article with instructions and further details about participating in the sale.

Inspired by similar work by Ampleforth, following the sale, we will launch the UNN Geyser, which will reward liquidity providers of UNN tokens to Uniswap with additional UNN tokens for their active participation. More details about that below.

Characteristics of UNN

The UNN token is used for governance purposes, such as voting on protection claims and related conflict resolution, adjusting risk parameters, or adjusting incentive programs. Early token holders will have the ability to participate in a liquidity mining program (e.g., the UNN Geyser) and a Voluntary Lockup Arrangement, where bolstered incentives beyond the UNN Geyser will be granted based on fixed lock-up periods.

Details for both the UNN Geyser and the Voluntary Lockup Arrangement are in the next section.

There will be a fixed total of 1 billion UNN tokens. The UNN token forms the basis for the UNION governance ecosystem, driving the incentives for liquidity programs, yield farming, and protection coverage across the platform. The token distribution is as follows:

  • 300 million UNN provisioned for the network, ecosystem, and marketing. Examples include bug bounties, audits, community outreach, development of future products, etc.
  • 93.333 million UNN have been sold in the first round, sold at $0.0075 / UNN
  • 200 million UNN have been sold in the second round, sold at $0.0125 / UNN
  • 43.2million UNN have been sold in the third round, sold at $0.025 / UNN
  • 113.467 million UNN reserved for the treasury.
  • 50 million UNN provisioned for the November 22, 2020, UNN sale.
  • 200 million UNN reserved for the team and advisors.

Token Locks

UNN is a governance token with aspirations of being a staple DeFi token that is included in major DeFi platforms and protocols because of its deep liquidity. To achieve this we designed and built the project from the beginning to have no mechanical locks (meaning they can be moved freely by holders) beyond delivery following the public sale. Early investors have been selected based on their long term vision with the project.

UNN + pUNN + uUNN

The UNN token is the centrepiece of a multi-layered system using a three-token model.

The complimentary tokens to UNN are uUNN and pUNN. The UNN token is the only token available during the token sale. The uUNN and pUNN tokens are used within the capital and pricing model of the platform to divest the governance process from the UNION asset protection model. Separate tokens for governance and protection remove conflicts of interest arising from complex market dynamics, such as those experienced by NXM in September 2020.

Buyers of protection will receive uUNN tokens giving them rights to the protection policy. Writers of protection will receive pUNN tokens, representing the % share of the protection pool that the writer is powering. The pUNN protection pool incentives program is detailed below following the Voluntary Lockup Arrangement section.

More details about how the multi-layered governance, claims assessment, and protection process function are available in the official UNION whitepaper.

Geyser

To incentivize on-chain liquidity providers of UNN (starting with Uniswap), we will provide a UNN Geyser. We built the UNN Geyser using AmpleForth’s Geyser as our base reference.

  1. The program will run for 1 year with 3-month phases.
  2. Phase 1: 20% APY for 3 months.
  3. Phase 2: To be announced.
  4. The launch pairs on Uniswap will be UNN/ETH and UNN/USDT.

Voluntary Lock Incentive

This lock incentivizes community participation in the early days of the project when governance activity is expected to be muted. In other words, think of this as a precursor to our governance incentive program. In return for locking their tokens, during which they cannot exchange/send the tokens, the purchasers will receive enhanced UNN incentives above the Geyser amount.

  1. The lock-up periods will be instituted based on limited windows of specific days. Locks will be offered in 30, 60, and 120-day terms. The goal is to encourage the alignment of incentives for the community in the long-term.
  2. The locked balance will be compounded daily and deposited (locked) into their wallet.
  3. Once the lock is lifted, the purchaser can exchange/send all tokens (original amount + interest).

A comparison of the different lock returns along with Geyser (no lock) for theoretical 1,000 token allocations.

Image for post

Protection Writer (pUNN) Incentives

Image for post

In return for providing the liquidity for a protection pool, protection writers or pUNN tokens holders will receive for their particular protection pool:

  1. 80% of the premium paid by buyers of protection offered by the protection pool. pUNN will receive the payment in whatever the premium token is. As per the whitepaper, we will start with accepting payments in stablecoin, like DAI.
  2. The remaining 20% will be split as such:
  3. 10% to a reserve fund for the pool, aka the “rainy day” fund.
  4. 10% as foundation operations and reserve pool.
  5. 50% of returns generated by reserve pools for the pUNN’s protection pool, which hold funds in excess of MCR for said pool.
  6. The remaining 50% will be split as such:
  7. 40% deposited back into the reserve fund.
  8. 10% as foundation operations and reserve pool.

Future Incentives

  1. Governance: in tandem with our products going live, we will launch our governance incentive where UNN holders can lock their tokens for voting power and boosts to staking and/or pool returns.
  2. Lending: to launch in tandem with our OC protection. People can borrow against their UNN holdings.
  3. **Other: **we will run discretionary incentives and governance voted incentives as suited for the benefit of the ecosystem.

Roadmap

Our roadmap reflects a balance between meeting the pressing market demand for a suite of complete protection products and providing a robust and well-tested solution. Given the nature of the technology solution, we will always err on the side of conservative. Below roadmap is our aspirational target and may be adjusted based on progress and market demand.

Q4 2020:

  • UNN Sale
  • Private Development / Testing
  • DAO strategic partner acquisition

Q1 2021:

  • Production launch of Smart Contract Protection
  • Production launch of Over-Collateralization
  • Lending program
  • Governance launch

Q2 2021:

  • Production launch of Single-Step Gas Protection
  • Production launch of Layer-1 Protection (Protocol TBD)
  • A secondary market for protection

H2 2021 and beyond:

  • Advanced models for better risk management/pricing
  • New Protection: Multi-Step Gas, Bundled, Impermanent Loss (IL)
  • Expanded Protection coverage
  • Protection / Risk Management analytics for institutional entities

UNION Best Practices & Compliance Standards

Our general thesis on best practices is to deploy a series of state-of-the-art checks to ensure regulatory compliance and the privacy of user data. This begins even before our token sale, with our initial sales.

Steps that we take to ensure compliance and user data privacy include:

  • Thorough KYC/AML is required for all UNN sales. This prevents wallet taint from funding not recorded on the blockchain prior to the token sale as well as validating purchasers during the token sale. UNN utilizes the expertise of 3rd party KYC/AML parties to ensure the security and sanctity of funds received.
  • Multi-Vector KYC/AML — Liveliness, documents, and mobile phone verification. We utilize a 4-point check, including liveliness, which detects whether the person in front of the camera is real or just a picture of a picture.
  • GDPR Compliance — We maintain data using Cayman Data Retention Standards with the view to meeting GDPR compliance requirements. We have invested heavily in network and server security infrastructure, including intrusion detection, and system/network monitoring for all systems that collect and maintain purchaser data as part of the sale.
  • Geofencing — Our token sales site is geofenced, meaning that connection (and participation) to the sale are blocked from restricted countries. These include:
  • The United States of America, The Cayman Islands, New Zealand, Canada, People’s Republic of China, Singapore, Republic of Korea, Belarus, Burundi, Central African Republic, Cuba, Iran, Libya, Democratic People’s Republic of Korea, Somalia, Sudan, Darfur, Mali, Republic of Guinea, Republic of Guinea-Bissau, Syria, Venezuela, and Zimbabwe
  • Wallet Check — We vet wallets to ensure they are not a party to illicit activity.

Would you like to earn token right now! ☞ CLICK HERE

Looking for more information…

☞ Website
☞ Explorer
☞ Source Code
☞ Social Channel
☞ Documentation
☞ Coinmarketcap

Create an Account and Trade NOW

Bittrex
Poloniex
Binance

Thank for visiting and reading this article! I’m highly appreciate your actions! Please share if you liked it!

#bockchain #bitcoin #union finance #unn

Angelina roda

Angelina roda

1624219980

How to Buy NFT Art Finance Token - The EASIEST METHOD! DO NOT MISS!!! JUST IN 4 MINUTES

NFT Art Finance is currently one of the most popular cryptocurrencies right now on the market, so in today’s video, I will be showing you guys how to easily buy NFT Art Finance on your phone using the Trust Wallet application.
📺 The video in this post was made by More LimSanity
The origin of the article: https://www.youtube.com/watch?v=sKE6Pc_w1IE
🔺 DISCLAIMER: The article is for information sharing. The content of this video is solely the opinions of the speaker who is not a licensed financial advisor or registered investment advisor. Not investment advice or legal advice.
Cryptocurrency trading is VERY risky. Make sure you understand these risks and that you are responsible for what you do with your money
🔥 If you’re a beginner. I believe the article below will be useful to you ☞ What You Should Know Before Investing in Cryptocurrency - For Beginner
⭐ ⭐ ⭐The project is of interest to the community. Join to Get free ‘GEEK coin’ (GEEKCASH coin)!
☞ **-----CLICK HERE-----**⭐ ⭐ ⭐
Thanks for visiting and watching! Please don’t forget to leave a like, comment and share!

#bitcoin #blockchain #nft art finance token #token #buy nft art finance #how to buy nft art finance token - the easiest method!

David mr

David mr

1624312800

SPORE FINANCE PREDICTION - WHAT IS SPORE FINANCE & SPORE FINANCE ANALYSIS - SPORE FINANCE

SPORE FINANCE PREDICTION - WHAT IS SPORE FINANCE & SPORE FINANCE ANALYSIS - SPORE FINANCE

In this video, I talk about spore finance coin and give my spore finance prediction. I talk about the latest spore finance analysis & spore finance crypto coin that recently has been hit pretty hard in the last 24 hours. I go over what is spore finance and how many holders are on this new crypto coin spore finance.
📺 The video in this post was made by Josh’s Finance
The origin of the article: https://www.youtube.com/watch?v=qbPQvdxCtEI
🔺 DISCLAIMER: The article is for information sharing. The content of this video is solely the opinions of the speaker who is not a licensed financial advisor or registered investment advisor. Not investment advice or legal advice.
Cryptocurrency trading is VERY risky. Make sure you understand these risks and that you are responsible for what you do with your money
🔥 If you’re a beginner. I believe the article below will be useful to you ☞ What You Should Know Before Investing in Cryptocurrency - For Beginner
⭐ ⭐ ⭐The project is of interest to the community. Join to Get free ‘GEEK coin’ (GEEKCASH coin)!
☞ **-----CLICK HERE-----**⭐ ⭐ ⭐
Thanks for visiting and watching! Please don’t forget to leave a like, comment and share!

#bitcoin #blockchain #spore finance #what is spore finance #spore finance prediction - what is spore finance & spore finance analysis - spore finance #spore finance prediction

Royce  Reinger

Royce Reinger

1658068560

WordsCounted: A Ruby Natural Language Processor

WordsCounted

We are all in the gutter, but some of us are looking at the stars.

-- Oscar Wilde

WordsCounted is a Ruby NLP (natural language processor). WordsCounted lets you implement powerful tokensation strategies with a very flexible tokeniser class.

Features

  • Out of the box, get the following data from any string or readable file, or URL:
    • Token count and unique token count
    • Token densities, frequencies, and lengths
    • Char count and average chars per token
    • The longest tokens and their lengths
    • The most frequent tokens and their frequencies.
  • A flexible way to exclude tokens from the tokeniser. You can pass a string, regexp, symbol, lambda, or an array of any combination of those types for powerful tokenisation strategies.
  • Pass your own regexp rules to the tokeniser if you prefer. The default regexp filters special characters but keeps hyphens and apostrophes. It also plays nicely with diacritics (UTF and unicode characters): Bayrūt is treated as ["Bayrūt"] and not ["Bayr", "ū", "t"], for example.
  • Opens and reads files. Pass in a file path or a url instead of a string.

Installation

Add this line to your application's Gemfile:

gem 'words_counted'

And then execute:

$ bundle

Or install it yourself as:

$ gem install words_counted

Usage

Pass in a string or a file path, and an optional filter and/or regexp.

counter = WordsCounted.count(
  "We are all in the gutter, but some of us are looking at the stars."
)

# Using a file
counter = WordsCounted.from_file("path/or/url/to/my/file.txt")

.count and .from_file are convenience methods that take an input, tokenise it, and return an instance of WordsCounted::Counter initialized with the tokens. The WordsCounted::Tokeniser and WordsCounted::Counter classes can be used alone, however.

API

WordsCounted

WordsCounted.count(input, options = {})

Tokenises input and initializes a WordsCounted::Counter object with the resulting tokens.

counter = WordsCounted.count("Hello Beirut!")

Accepts two options: exclude and regexp. See Excluding tokens from the analyser and Passing in a custom regexp respectively.

WordsCounted.from_file(path, options = {})

Reads and tokenises a file, and initializes a WordsCounted::Counter object with the resulting tokens.

counter = WordsCounted.from_file("hello_beirut.txt")

Accepts the same options as .count.

Tokeniser

The tokeniser allows you to tokenise text in a variety of ways. You can pass in your own rules for tokenisation, and apply a powerful filter with any combination of rules as long as they can boil down into a lambda.

Out of the box the tokeniser includes only alpha chars. Hyphenated tokens and tokens with apostrophes are considered a single token.

#tokenise([pattern: TOKEN_REGEXP, exclude: nil])

tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise

# With `exclude`
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise(exclude: "hello")

# With `pattern`
tokeniser = WordsCounted::Tokeniser.new("I <3 Beirut!").tokenise(pattern: /[a-z]/i)

See Excluding tokens from the analyser and Passing in a custom regexp for more information.

Counter

The WordsCounted::Counter class allows you to collect various statistics from an array of tokens.

#token_count

Returns the token count of a given string.

counter.token_count #=> 15

#token_frequency

Returns a sorted (unstable) two-dimensional array where each element is a token and its frequency. The array is sorted by frequency in descending order.

counter.token_frequency

[
  ["the", 2],
  ["are", 2],
  ["we",  1],
  # ...
  ["all", 1]
]

#most_frequent_tokens

Returns a hash where each key-value pair is a token and its frequency.

counter.most_frequent_tokens

{ "are" => 2, "the" => 2 }

#token_lengths

Returns a sorted (unstable) two-dimentional array where each element contains a token and its length. The array is sorted by length in descending order.

counter.token_lengths

[
  ["looking", 7],
  ["gutter",  6],
  ["stars",   5],
  # ...
  ["in",      2]
]

#longest_tokens

Returns a hash where each key-value pair is a token and its length.

counter.longest_tokens

{ "looking" => 7 }

#token_density([ precision: 2 ])

Returns a sorted (unstable) two-dimentional array where each element contains a token and its density as a float, rounded to a precision of two. The array is sorted by density in descending order. It accepts a precision argument, which must be a float.

counter.token_density

[
  ["are",     0.13],
  ["the",     0.13],
  ["but",     0.07 ],
  # ...
  ["we",      0.07 ]
]

#char_count

Returns the char count of tokens.

counter.char_count #=> 76

#average_chars_per_token([ precision: 2 ])

Returns the average char count per token rounded to two decimal places. Accepts a precision argument which defaults to two. Precision must be a float.

counter.average_chars_per_token #=> 4

#uniq_token_count

Returns the number of unique tokens.

counter.uniq_token_count #=> 13

Excluding tokens from the tokeniser

You can exclude anything you want from the input by passing the exclude option. The exclude option accepts a variety of filters and is extremely flexible.

  1. A space-delimited string. The filter will normalise the string.
  2. A regular expression.
  3. A lambda.
  4. A symbol that names a predicate method. For example :odd?.
  5. An array of any combination of the above.
tokeniser =
  WordsCounted::Tokeniser.new(
    "Magnificent! That was magnificent, Trevor."
  )

# Using a string
tokeniser.tokenise(exclude: "was magnificent")
# => ["that", "trevor"]

# Using a regular expression
tokeniser.tokenise(exclude: /trevor/)
# => ["magnificent", "that", "was", "magnificent"]

# Using a lambda
tokeniser.tokenise(exclude: ->(t) { t.length < 4 })
# => ["magnificent", "that", "magnificent", "trevor"]

# Using symbol
tokeniser = WordsCounted::Tokeniser.new("Hello! محمد")
tokeniser.tokenise(exclude: :ascii_only?)
# => ["محمد"]

# Using an array
tokeniser = WordsCounted::Tokeniser.new(
  "Hello! اسماءنا هي محمد، كارولينا، سامي، وداني"
)
tokeniser.tokenise(
  exclude: [:ascii_only?, /محمد/, ->(t) { t.length > 6}, "و"]
)
# => ["هي", "سامي", "وداني"]

Passing in a custom regexp

The default regexp accounts for letters, hyphenated tokens, and apostrophes. This means twenty-one is treated as one token. So is Mohamad's.

/[\p{Alpha}\-']+/

You can pass your own criteria as a Ruby regular expression to split your string as desired.

For example, if you wanted to include numbers, you can override the regular expression:

counter = WordsCounted.count("Numbers 1, 2, and 3", pattern: /[\p{Alnum}\-']+/)
counter.tokens
#=> ["numbers", "1", "2", "and", "3"]

Opening and reading files

Use the from_file method to open files. from_file accepts the same options as .count. The file path can be a URL.

counter = WordsCounted.from_file("url/or/path/to/file.text")

Gotchas

A hyphen used in leu of an em or en dash will form part of the token. This affects the tokeniser algorithm.

counter = WordsCounted.count("How do you do?-you are well, I see.")
counter.token_frequency

[
  ["do",   2],
  ["how",  1],
  ["you",  1],
  ["-you", 1], # WTF, mate!
  ["are",  1],
  # ...
]

In this example -you and you are separate tokens. Also, the tokeniser does not include numbers by default. Remember that you can pass your own regular expression if the default behaviour does not fit your needs.

A note on case sensitivity

The program will normalise (downcase) all incoming strings for consistency and filters.

Roadmap

Ability to open URLs

def self.from_url
  # open url and send string here after removing html
end

Are you using WordsCounted to do something interesting? Please tell me about it.

Gem Version 

RubyDoc documentation.

Demo

Visit this website for one example of what you can do with WordsCounted.


Contributors

See contributors.

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

Author: Abitdodgy
Source Code: https://github.com/abitdodgy/words_counted 
License: MIT license

#ruby #nlp 

Words Counted: A Ruby Natural Language Processor.

WordsCounted

We are all in the gutter, but some of us are looking at the stars.

-- Oscar Wilde

WordsCounted is a Ruby NLP (natural language processor). WordsCounted lets you implement powerful tokensation strategies with a very flexible tokeniser class.

Are you using WordsCounted to do something interesting? Please tell me about it.

 

Demo

Visit this website for one example of what you can do with WordsCounted.

Features

  • Out of the box, get the following data from any string or readable file, or URL:
    • Token count and unique token count
    • Token densities, frequencies, and lengths
    • Char count and average chars per token
    • The longest tokens and their lengths
    • The most frequent tokens and their frequencies.
  • A flexible way to exclude tokens from the tokeniser. You can pass a string, regexp, symbol, lambda, or an array of any combination of those types for powerful tokenisation strategies.
  • Pass your own regexp rules to the tokeniser if you prefer. The default regexp filters special characters but keeps hyphens and apostrophes. It also plays nicely with diacritics (UTF and unicode characters): Bayrūt is treated as ["Bayrūt"] and not ["Bayr", "ū", "t"], for example.
  • Opens and reads files. Pass in a file path or a url instead of a string.

Installation

Add this line to your application's Gemfile:

gem 'words_counted'

And then execute:

$ bundle

Or install it yourself as:

$ gem install words_counted

Usage

Pass in a string or a file path, and an optional filter and/or regexp.

counter = WordsCounted.count(
  "We are all in the gutter, but some of us are looking at the stars."
)

# Using a file
counter = WordsCounted.from_file("path/or/url/to/my/file.txt")

.count and .from_file are convenience methods that take an input, tokenise it, and return an instance of WordsCounted::Counter initialized with the tokens. The WordsCounted::Tokeniser and WordsCounted::Counter classes can be used alone, however.

API

WordsCounted

WordsCounted.count(input, options = {})

Tokenises input and initializes a WordsCounted::Counter object with the resulting tokens.

counter = WordsCounted.count("Hello Beirut!")

Accepts two options: exclude and regexp. See Excluding tokens from the analyser and Passing in a custom regexp respectively.

WordsCounted.from_file(path, options = {})

Reads and tokenises a file, and initializes a WordsCounted::Counter object with the resulting tokens.

counter = WordsCounted.from_file("hello_beirut.txt")

Accepts the same options as .count.

Tokeniser

The tokeniser allows you to tokenise text in a variety of ways. You can pass in your own rules for tokenisation, and apply a powerful filter with any combination of rules as long as they can boil down into a lambda.

Out of the box the tokeniser includes only alpha chars. Hyphenated tokens and tokens with apostrophes are considered a single token.

#tokenise([pattern: TOKEN_REGEXP, exclude: nil])

tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise

# With `exclude`
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise(exclude: "hello")

# With `pattern`
tokeniser = WordsCounted::Tokeniser.new("I <3 Beirut!").tokenise(pattern: /[a-z]/i)

See Excluding tokens from the analyser and Passing in a custom regexp for more information.

Counter

The WordsCounted::Counter class allows you to collect various statistics from an array of tokens.

#token_count

Returns the token count of a given string.

counter.token_count #=> 15

#token_frequency

Returns a sorted (unstable) two-dimensional array where each element is a token and its frequency. The array is sorted by frequency in descending order.

counter.token_frequency

[
  ["the", 2],
  ["are", 2],
  ["we",  1],
  # ...
  ["all", 1]
]

#most_frequent_tokens

Returns a hash where each key-value pair is a token and its frequency.

counter.most_frequent_tokens

{ "are" => 2, "the" => 2 }

#token_lengths

Returns a sorted (unstable) two-dimentional array where each element contains a token and its length. The array is sorted by length in descending order.

counter.token_lengths

[
  ["looking", 7],
  ["gutter",  6],
  ["stars",   5],
  # ...
  ["in",      2]
]

#longest_tokens

Returns a hash where each key-value pair is a token and its length.

counter.longest_tokens

{ "looking" => 7 }

#token_density([ precision: 2 ])

Returns a sorted (unstable) two-dimentional array where each element contains a token and its density as a float, rounded to a precision of two. The array is sorted by density in descending order. It accepts a precision argument, which must be a float.

counter.token_density

[
  ["are",     0.13],
  ["the",     0.13],
  ["but",     0.07 ],
  # ...
  ["we",      0.07 ]
]

#char_count

Returns the char count of tokens.

counter.char_count #=> 76

#average_chars_per_token([ precision: 2 ])

Returns the average char count per token rounded to two decimal places. Accepts a precision argument which defaults to two. Precision must be a float.

counter.average_chars_per_token #=> 4

#uniq_token_count

Returns the number of unique tokens.

counter.uniq_token_count #=> 13

Excluding tokens from the tokeniser

You can exclude anything you want from the input by passing the exclude option. The exclude option accepts a variety of filters and is extremely flexible.

  1. A space-delimited string. The filter will normalise the string.
  2. A regular expression.
  3. A lambda.
  4. A symbol that names a predicate method. For example :odd?.
  5. An array of any combination of the above.
tokeniser =
  WordsCounted::Tokeniser.new(
    "Magnificent! That was magnificent, Trevor."
  )

# Using a string
tokeniser.tokenise(exclude: "was magnificent")
# => ["that", "trevor"]

# Using a regular expression
tokeniser.tokenise(exclude: /trevor/)
# => ["magnificent", "that", "was", "magnificent"]

# Using a lambda
tokeniser.tokenise(exclude: ->(t) { t.length < 4 })
# => ["magnificent", "that", "magnificent", "trevor"]

# Using symbol
tokeniser = WordsCounted::Tokeniser.new("Hello! محمد")
tokeniser.tokenise(exclude: :ascii_only?)
# => ["محمد"]

# Using an array
tokeniser = WordsCounted::Tokeniser.new(
  "Hello! اسماءنا هي محمد، كارولينا، سامي، وداني"
)
tokeniser.tokenise(
  exclude: [:ascii_only?, /محمد/, ->(t) { t.length > 6}, "و"]
)
# => ["هي", "سامي", "وداني"]

Passing in a custom regexp

The default regexp accounts for letters, hyphenated tokens, and apostrophes. This means twenty-one is treated as one token. So is Mohamad's.

/[\p{Alpha}\-']+/

You can pass your own criteria as a Ruby regular expression to split your string as desired.

For example, if you wanted to include numbers, you can override the regular expression:

counter = WordsCounted.count("Numbers 1, 2, and 3", pattern: /[\p{Alnum}\-']+/)
counter.tokens
#=> ["numbers", "1", "2", "and", "3"]

Opening and reading files

Use the from_file method to open files. from_file accepts the same options as .count. The file path can be a URL.

counter = WordsCounted.from_file("url/or/path/to/file.text")

Gotchas

A hyphen used in leu of an em or en dash will form part of the token. This affects the tokeniser algorithm.

counter = WordsCounted.count("How do you do?-you are well, I see.")
counter.token_frequency

[
  ["do",   2],
  ["how",  1],
  ["you",  1],
  ["-you", 1], # WTF, mate!
  ["are",  1],
  # ...
]

In this example -you and you are separate tokens. Also, the tokeniser does not include numbers by default. Remember that you can pass your own regular expression if the default behaviour does not fit your needs.

A note on case sensitivity

The program will normalise (downcase) all incoming strings for consistency and filters.

Roadmap

Ability to open URLs

def self.from_url
  # open url and send string here after removing html
end

Contributors

See contributors.

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request

Author: abitdodgy
Source code: https://github.com/abitdodgy/words_counted
License: MIT license

#ruby  #ruby-on-rails