1615254558
Bogged Finance is a token which runs code each time it is executed for various use-cases, including real-time and historical decentralised oracles on Binance Smart Chain.
BOG is the utility token that is used for:
An introduction to $BOG, BogTools and the Bogged Ecosystem.
The weak should fear the strong. (Part of our Bogged ARG)
The Bogged Token
The Bogged Token, $BOG, and associated contract are the key to the Bogged Ecosystem. All components of the Bogged ecosystem are unlocked through use of the $BOG Token.
Tokenomics:
Initial Supply of 2,500,000 BOG.
0.1% of all transactions are burnt, decreasing total supply.
4.4% is redistributed to LP Stakers, increasing TVL.
Initial liquidity has been locked for 6 Months minimum.
Once the BogTools begin shipping and the ecosystem is growing organically based on use of BogTools, the transaction fees mentioned above will be removed, and the staker rewards will be replaced with fees from the usage and deployment of BogTools.
Tokenomics
Fixed total supply of 2,500,000 BOG
There currently is a 4.5% fee on every transaction, this is split as follows:
Initial Distribution
There was no presale, and no tokens were reserved for the dev team.
The Initial Liquidity has been locked in a contract for a period of 6 months. View Contract.
Staking
Providing BOG-BNB liquidity on Pancakeswap will give you BOG-BNB Cake-LP tokens which you can stake to earn a share of the 4.4% transaction fee distributed to stakers. In the future it will also entitle you to a portion of Bogdabot Deployment and Maintenance fees. See the Staking section for more info.
Governance
While many tokens claim to be community run, Bogged has the rails built in to enable community governance. The BogTools Fees are modifiable and once enough holders are reached we will move to using a Governance Contract that will allow the token holders to vote on any changes.
BogTools (formerly referred to as Bogdabots)
BogTools are deployable individual contracts on BSC that connect to hooks built into the Bogged Transaction System, allowing for code to be executed as transactions are made with $BOG.
As $BOG transaction volume increases the BogTools contracts will get closer and closer to running on the blockchain 24/7 - BOG is unique in that it is the first ever implementation of such a system.
BogTools can currently only be created by the team. However, there will be interfaces built to the BogTools Standard that allow deploying BogTools to perform tasks on the blockchain, similar to Uniswap/Pancakeswap pair factories.
The first such publicly deployable BogTool is On-Chain Historical Price Oracles.
BOG launched with 2,500,000 coins with a deflationary model to begin with, to discourage swing trading. The launch was a fair IDO with 50 BNB and all 2,500,000 BOG locked in Pancakeswap for 6 months.
Bogdabots are individual contracts deployed on BSC that connect to hooks built into the Bogged Transaction System, allowing code to be executed as transactions are made with $BOG.
As $BOG transaction volume increases the Bogdabot contracts will get closer and closer to running on the blockchain 24/7 - BOG is unique in that it is the first seen implementation of this, partially due to the low gas fees allowed by BSC.
The first product is BogCharts which will be targeted towards developers of brand new coins who are not able to use any centralised services for price charting.
This will take the form of a deployable On-Chain Bogdabot Oracle to monitor the price of their token, paired with a UI package for charting the oracle data that they are able to self-host on their token website.
These deployable oracles will cost an ongoing $BOG fee to use, depending on price accumulation period and other factors. This fee will be fed back into the BOG ecosystem as rewards for holders/stakers.
Oracles on Bogged allow you to get any information on the Binance Smart Chain, in an on-chain, decentralised and frequently updated manner.
This is achieved by deploying a BogTool contract which is able to record and store the price of any asset-pair whenever a $BOG transaction occurs.
They can then be used to access reliable current & historical data.
We have a simple implementation of this on our website in the form of a price tracker. This also leads us into our next formal product announcement, BogCharts.
As all our early adopters know, it is very hard to get a price chart early on in a tokens lifetime, despite it being potentially crucial for the confidence of the early investors. The main BSC charting websites are centralized, not always up to date, and very often require you to hold a substantial balance of their tokens to view the charts.
The first BogTool product we are announcing to make use of our On-Chain price oracles is BogCharts which will be targeted towards developers of brand new coins who are not able to use any centralised services for price charting.
This will take the form of a deployable On-Chain BogTool Oracle to monitor the price of their token, paired with a UI package for charting the oracle data that they are able to self-host on their token website.
These deployable oracles will cost an ongoing $BOG fee to use, depending on price accumulation period and other factors. This fee will be fed back into the BOG ecosystem.
Allowing contracts to effectively run constantly on the blockchain without any off-chain input allows for some interesting fully-decentralised possibilities.
One example of this would be limit orders and stop losses for Pancakeswap.
Another would be automatically tethering up when whales move large amounts into exchanges.
Our current focus will be expanding on the Oracles and allowing people to deploy their own, and we can’t wait to see what people will build upon these. After this we will visit some of the other possibilities.
In addition to the BogTools Ecosystem, we continue to build out the Entertainment portion of the Bogged ARG. With new upgrades, gamification, and more to keep users and holders engaged with the protocol.
Stage 2 for the Bogged ARG involves the release of the Sminem Token. Users will be able to claim Sminem by staking their Bogged LP for 24 hours, from the Sminem Pool.
The Sminem Release will take advantage of BogTools by removing the existing $NGMI BogTools contract to replace it with a new $NGMI contract which fits into the new rules of the Bogged ARG.
Sminem allows users to avoid receiving $NGMI when selling $BOG on pancakeswap, a useless curse-token that is untransferable. This is done at risk of losing the Sminem to death (burnt) or the pool, where he will be claimed by another user.
We feel the ARG is an exciting and innovative way of maintaining our userbase as we deploy our technology to the Binance Smart Chain. The Bogged ARG is an optional part of the Bogged ecosystem, that we are still very much committed to. The ARG incentivises users to make transactions in $BOG which is beneficial to the ecosystem as more transactions provide more computing power to the BogTools contracts.
Would you like to earn TOKEN right now! ☞ CLICK HERE
BOG has been listed on a number of crypto exchanges, unlike other main cryptocurrencies, it cannot be directly purchased with fiats money. However, You can still easily buy this coin by first buying Bitcoin, ETH, USDT from any large exchanges and then transfer to the exchange that offers to trade this coin, in this guide article we will walk you through in detail the steps to buy BOG
You will have to first buy one of the major cryptocurrencies, usually either Bitcoin (BTC), Ethereum (ETH), Tether (USDT)…
We will use Binance Exchange here as it is one of the largest crypto exchanges that accept fiat deposits.
Binance is a popular cryptocurrency exchange which was started in China but then moved their headquarters to the crypto-friendly Island of Malta in the EU. Binance is popular for its crypto to crypto exchange services. Binance exploded onto the scene in the mania of 2017 and has since gone on to become the top crypto exchange in the world.
Once you finished the KYC process. You will be asked to add a payment method. Here you can either choose to provide a credit/debit card or use a bank transfer, and buy one of the major cryptocurrencies, usually either Bitcoin (BTC), Ethereum (ETH), Tether (USDT)
Step by Step Guide : What is Binance | How to Create an account on Binance (Updated 2021)
Next step - Transfer your cryptos to an Altcoin Exchange
Since BOG is an altcoin we need to transfer our coins to an exchange that BOG can be traded. Below is a list of exchanges that offers to trade BOG in various market pairs, head to their websites and register for an account.
Once finished you will then need to make a BTC/ETH/USDT deposit to the exchange from Binance depending on the available market pairs. After the deposit is confirmed you may then purchase BOG from the exchange.
Exchange: Pancakeswap
Apart from the exchange(s) above, there are a few popular crypto exchanges where they have decent daily trading volumes and a huge user base. This will ensure you will be able to sell your coins at any time and the fees will usually be lower. It is suggested that you also register on these exchanges since once BOG gets listed there it will attract a large amount of trading volumes from the users there, that means you will be having some great trading opportunities!
Top exchanges for token-coin trading. Follow instructions and make unlimited money
☞ https://www.binance.com
☞ https://www.bittrex.com
☞ https://www.poloniex.com
☞ https://www.bitfinex.com
☞ https://www.huobi.com
☞ https://www.mxc.ai
☞ https://www.probit.com
☞ https://www.gate.io
☞ https://www.coinbase.com
Find more information BOG
☞ Website ☞ Explorer ☞ Social Channel ☞ Social Channel 2 ☞ Coinmarketcap
I hope this post will help you. If you liked this, please sharing it with others. Thank you!
#cryptocurrency #bitcoin #bogged finance #bog
1615254558
Bogged Finance is a token which runs code each time it is executed for various use-cases, including real-time and historical decentralised oracles on Binance Smart Chain.
BOG is the utility token that is used for:
An introduction to $BOG, BogTools and the Bogged Ecosystem.
The weak should fear the strong. (Part of our Bogged ARG)
The Bogged Token
The Bogged Token, $BOG, and associated contract are the key to the Bogged Ecosystem. All components of the Bogged ecosystem are unlocked through use of the $BOG Token.
Tokenomics:
Initial Supply of 2,500,000 BOG.
0.1% of all transactions are burnt, decreasing total supply.
4.4% is redistributed to LP Stakers, increasing TVL.
Initial liquidity has been locked for 6 Months minimum.
Once the BogTools begin shipping and the ecosystem is growing organically based on use of BogTools, the transaction fees mentioned above will be removed, and the staker rewards will be replaced with fees from the usage and deployment of BogTools.
Tokenomics
Fixed total supply of 2,500,000 BOG
There currently is a 4.5% fee on every transaction, this is split as follows:
Initial Distribution
There was no presale, and no tokens were reserved for the dev team.
The Initial Liquidity has been locked in a contract for a period of 6 months. View Contract.
Staking
Providing BOG-BNB liquidity on Pancakeswap will give you BOG-BNB Cake-LP tokens which you can stake to earn a share of the 4.4% transaction fee distributed to stakers. In the future it will also entitle you to a portion of Bogdabot Deployment and Maintenance fees. See the Staking section for more info.
Governance
While many tokens claim to be community run, Bogged has the rails built in to enable community governance. The BogTools Fees are modifiable and once enough holders are reached we will move to using a Governance Contract that will allow the token holders to vote on any changes.
BogTools (formerly referred to as Bogdabots)
BogTools are deployable individual contracts on BSC that connect to hooks built into the Bogged Transaction System, allowing for code to be executed as transactions are made with $BOG.
As $BOG transaction volume increases the BogTools contracts will get closer and closer to running on the blockchain 24/7 - BOG is unique in that it is the first ever implementation of such a system.
BogTools can currently only be created by the team. However, there will be interfaces built to the BogTools Standard that allow deploying BogTools to perform tasks on the blockchain, similar to Uniswap/Pancakeswap pair factories.
The first such publicly deployable BogTool is On-Chain Historical Price Oracles.
BOG launched with 2,500,000 coins with a deflationary model to begin with, to discourage swing trading. The launch was a fair IDO with 50 BNB and all 2,500,000 BOG locked in Pancakeswap for 6 months.
Bogdabots are individual contracts deployed on BSC that connect to hooks built into the Bogged Transaction System, allowing code to be executed as transactions are made with $BOG.
As $BOG transaction volume increases the Bogdabot contracts will get closer and closer to running on the blockchain 24/7 - BOG is unique in that it is the first seen implementation of this, partially due to the low gas fees allowed by BSC.
The first product is BogCharts which will be targeted towards developers of brand new coins who are not able to use any centralised services for price charting.
This will take the form of a deployable On-Chain Bogdabot Oracle to monitor the price of their token, paired with a UI package for charting the oracle data that they are able to self-host on their token website.
These deployable oracles will cost an ongoing $BOG fee to use, depending on price accumulation period and other factors. This fee will be fed back into the BOG ecosystem as rewards for holders/stakers.
Oracles on Bogged allow you to get any information on the Binance Smart Chain, in an on-chain, decentralised and frequently updated manner.
This is achieved by deploying a BogTool contract which is able to record and store the price of any asset-pair whenever a $BOG transaction occurs.
They can then be used to access reliable current & historical data.
We have a simple implementation of this on our website in the form of a price tracker. This also leads us into our next formal product announcement, BogCharts.
As all our early adopters know, it is very hard to get a price chart early on in a tokens lifetime, despite it being potentially crucial for the confidence of the early investors. The main BSC charting websites are centralized, not always up to date, and very often require you to hold a substantial balance of their tokens to view the charts.
The first BogTool product we are announcing to make use of our On-Chain price oracles is BogCharts which will be targeted towards developers of brand new coins who are not able to use any centralised services for price charting.
This will take the form of a deployable On-Chain BogTool Oracle to monitor the price of their token, paired with a UI package for charting the oracle data that they are able to self-host on their token website.
These deployable oracles will cost an ongoing $BOG fee to use, depending on price accumulation period and other factors. This fee will be fed back into the BOG ecosystem.
Allowing contracts to effectively run constantly on the blockchain without any off-chain input allows for some interesting fully-decentralised possibilities.
One example of this would be limit orders and stop losses for Pancakeswap.
Another would be automatically tethering up when whales move large amounts into exchanges.
Our current focus will be expanding on the Oracles and allowing people to deploy their own, and we can’t wait to see what people will build upon these. After this we will visit some of the other possibilities.
In addition to the BogTools Ecosystem, we continue to build out the Entertainment portion of the Bogged ARG. With new upgrades, gamification, and more to keep users and holders engaged with the protocol.
Stage 2 for the Bogged ARG involves the release of the Sminem Token. Users will be able to claim Sminem by staking their Bogged LP for 24 hours, from the Sminem Pool.
The Sminem Release will take advantage of BogTools by removing the existing $NGMI BogTools contract to replace it with a new $NGMI contract which fits into the new rules of the Bogged ARG.
Sminem allows users to avoid receiving $NGMI when selling $BOG on pancakeswap, a useless curse-token that is untransferable. This is done at risk of losing the Sminem to death (burnt) or the pool, where he will be claimed by another user.
We feel the ARG is an exciting and innovative way of maintaining our userbase as we deploy our technology to the Binance Smart Chain. The Bogged ARG is an optional part of the Bogged ecosystem, that we are still very much committed to. The ARG incentivises users to make transactions in $BOG which is beneficial to the ecosystem as more transactions provide more computing power to the BogTools contracts.
Would you like to earn TOKEN right now! ☞ CLICK HERE
BOG has been listed on a number of crypto exchanges, unlike other main cryptocurrencies, it cannot be directly purchased with fiats money. However, You can still easily buy this coin by first buying Bitcoin, ETH, USDT from any large exchanges and then transfer to the exchange that offers to trade this coin, in this guide article we will walk you through in detail the steps to buy BOG
You will have to first buy one of the major cryptocurrencies, usually either Bitcoin (BTC), Ethereum (ETH), Tether (USDT)…
We will use Binance Exchange here as it is one of the largest crypto exchanges that accept fiat deposits.
Binance is a popular cryptocurrency exchange which was started in China but then moved their headquarters to the crypto-friendly Island of Malta in the EU. Binance is popular for its crypto to crypto exchange services. Binance exploded onto the scene in the mania of 2017 and has since gone on to become the top crypto exchange in the world.
Once you finished the KYC process. You will be asked to add a payment method. Here you can either choose to provide a credit/debit card or use a bank transfer, and buy one of the major cryptocurrencies, usually either Bitcoin (BTC), Ethereum (ETH), Tether (USDT)
Step by Step Guide : What is Binance | How to Create an account on Binance (Updated 2021)
Next step - Transfer your cryptos to an Altcoin Exchange
Since BOG is an altcoin we need to transfer our coins to an exchange that BOG can be traded. Below is a list of exchanges that offers to trade BOG in various market pairs, head to their websites and register for an account.
Once finished you will then need to make a BTC/ETH/USDT deposit to the exchange from Binance depending on the available market pairs. After the deposit is confirmed you may then purchase BOG from the exchange.
Exchange: Pancakeswap
Apart from the exchange(s) above, there are a few popular crypto exchanges where they have decent daily trading volumes and a huge user base. This will ensure you will be able to sell your coins at any time and the fees will usually be lower. It is suggested that you also register on these exchanges since once BOG gets listed there it will attract a large amount of trading volumes from the users there, that means you will be having some great trading opportunities!
Top exchanges for token-coin trading. Follow instructions and make unlimited money
☞ https://www.binance.com
☞ https://www.bittrex.com
☞ https://www.poloniex.com
☞ https://www.bitfinex.com
☞ https://www.huobi.com
☞ https://www.mxc.ai
☞ https://www.probit.com
☞ https://www.gate.io
☞ https://www.coinbase.com
Find more information BOG
☞ Website ☞ Explorer ☞ Social Channel ☞ Social Channel 2 ☞ Coinmarketcap
I hope this post will help you. If you liked this, please sharing it with others. Thank you!
#cryptocurrency #bitcoin #bogged finance #bog
1624219980
NFT Art Finance is currently one of the most popular cryptocurrencies right now on the market, so in today’s video, I will be showing you guys how to easily buy NFT Art Finance on your phone using the Trust Wallet application.
📺 The video in this post was made by More LimSanity
The origin of the article: https://www.youtube.com/watch?v=sKE6Pc_w1IE
🔺 DISCLAIMER: The article is for information sharing. The content of this video is solely the opinions of the speaker who is not a licensed financial advisor or registered investment advisor. Not investment advice or legal advice.
Cryptocurrency trading is VERY risky. Make sure you understand these risks and that you are responsible for what you do with your money
🔥 If you’re a beginner. I believe the article below will be useful to you ☞ What You Should Know Before Investing in Cryptocurrency - For Beginner
⭐ ⭐ ⭐The project is of interest to the community. Join to Get free ‘GEEK coin’ (GEEKCASH coin)!
☞ **-----CLICK HERE-----**⭐ ⭐ ⭐
Thanks for visiting and watching! Please don’t forget to leave a like, comment and share!
#bitcoin #blockchain #nft art finance token #token #buy nft art finance #how to buy nft art finance token - the easiest method!
1624312800
SPORE FINANCE PREDICTION - WHAT IS SPORE FINANCE & SPORE FINANCE ANALYSIS - SPORE FINANCE
In this video, I talk about spore finance coin and give my spore finance prediction. I talk about the latest spore finance analysis & spore finance crypto coin that recently has been hit pretty hard in the last 24 hours. I go over what is spore finance and how many holders are on this new crypto coin spore finance.
📺 The video in this post was made by Josh’s Finance
The origin of the article: https://www.youtube.com/watch?v=qbPQvdxCtEI
🔺 DISCLAIMER: The article is for information sharing. The content of this video is solely the opinions of the speaker who is not a licensed financial advisor or registered investment advisor. Not investment advice or legal advice.
Cryptocurrency trading is VERY risky. Make sure you understand these risks and that you are responsible for what you do with your money
🔥 If you’re a beginner. I believe the article below will be useful to you ☞ What You Should Know Before Investing in Cryptocurrency - For Beginner
⭐ ⭐ ⭐The project is of interest to the community. Join to Get free ‘GEEK coin’ (GEEKCASH coin)!
☞ **-----CLICK HERE-----**⭐ ⭐ ⭐
Thanks for visiting and watching! Please don’t forget to leave a like, comment and share!
#bitcoin #blockchain #spore finance #what is spore finance #spore finance prediction - what is spore finance & spore finance analysis - spore finance #spore finance prediction
1659601560
We are all in the gutter, but some of us are looking at the stars.
-- Oscar Wilde
WordsCounted is a Ruby NLP (natural language processor). WordsCounted lets you implement powerful tokensation strategies with a very flexible tokeniser class.
Are you using WordsCounted to do something interesting? Please tell me about it.
Visit this website for one example of what you can do with WordsCounted.
["Bayrūt"]
and not ["Bayr", "ū", "t"]
, for example.Add this line to your application's Gemfile:
gem 'words_counted'
And then execute:
$ bundle
Or install it yourself as:
$ gem install words_counted
Pass in a string or a file path, and an optional filter and/or regexp.
counter = WordsCounted.count(
"We are all in the gutter, but some of us are looking at the stars."
)
# Using a file
counter = WordsCounted.from_file("path/or/url/to/my/file.txt")
.count
and .from_file
are convenience methods that take an input, tokenise it, and return an instance of WordsCounted::Counter
initialized with the tokens. The WordsCounted::Tokeniser
and WordsCounted::Counter
classes can be used alone, however.
WordsCounted.count(input, options = {})
Tokenises input and initializes a WordsCounted::Counter
object with the resulting tokens.
counter = WordsCounted.count("Hello Beirut!")
Accepts two options: exclude
and regexp
. See Excluding tokens from the analyser and Passing in a custom regexp respectively.
WordsCounted.from_file(path, options = {})
Reads and tokenises a file, and initializes a WordsCounted::Counter
object with the resulting tokens.
counter = WordsCounted.from_file("hello_beirut.txt")
Accepts the same options as .count
.
The tokeniser allows you to tokenise text in a variety of ways. You can pass in your own rules for tokenisation, and apply a powerful filter with any combination of rules as long as they can boil down into a lambda.
Out of the box the tokeniser includes only alpha chars. Hyphenated tokens and tokens with apostrophes are considered a single token.
#tokenise([pattern: TOKEN_REGEXP, exclude: nil])
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise
# With `exclude`
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise(exclude: "hello")
# With `pattern`
tokeniser = WordsCounted::Tokeniser.new("I <3 Beirut!").tokenise(pattern: /[a-z]/i)
See Excluding tokens from the analyser and Passing in a custom regexp for more information.
The WordsCounted::Counter
class allows you to collect various statistics from an array of tokens.
#token_count
Returns the token count of a given string.
counter.token_count #=> 15
#token_frequency
Returns a sorted (unstable) two-dimensional array where each element is a token and its frequency. The array is sorted by frequency in descending order.
counter.token_frequency
[
["the", 2],
["are", 2],
["we", 1],
# ...
["all", 1]
]
#most_frequent_tokens
Returns a hash where each key-value pair is a token and its frequency.
counter.most_frequent_tokens
{ "are" => 2, "the" => 2 }
#token_lengths
Returns a sorted (unstable) two-dimentional array where each element contains a token and its length. The array is sorted by length in descending order.
counter.token_lengths
[
["looking", 7],
["gutter", 6],
["stars", 5],
# ...
["in", 2]
]
#longest_tokens
Returns a hash where each key-value pair is a token and its length.
counter.longest_tokens
{ "looking" => 7 }
#token_density([ precision: 2 ])
Returns a sorted (unstable) two-dimentional array where each element contains a token and its density as a float, rounded to a precision of two. The array is sorted by density in descending order. It accepts a precision
argument, which must be a float.
counter.token_density
[
["are", 0.13],
["the", 0.13],
["but", 0.07 ],
# ...
["we", 0.07 ]
]
#char_count
Returns the char count of tokens.
counter.char_count #=> 76
#average_chars_per_token([ precision: 2 ])
Returns the average char count per token rounded to two decimal places. Accepts a precision argument which defaults to two. Precision must be a float.
counter.average_chars_per_token #=> 4
#uniq_token_count
Returns the number of unique tokens.
counter.uniq_token_count #=> 13
You can exclude anything you want from the input by passing the exclude
option. The exclude option accepts a variety of filters and is extremely flexible.
:odd?
.tokeniser =
WordsCounted::Tokeniser.new(
"Magnificent! That was magnificent, Trevor."
)
# Using a string
tokeniser.tokenise(exclude: "was magnificent")
# => ["that", "trevor"]
# Using a regular expression
tokeniser.tokenise(exclude: /trevor/)
# => ["magnificent", "that", "was", "magnificent"]
# Using a lambda
tokeniser.tokenise(exclude: ->(t) { t.length < 4 })
# => ["magnificent", "that", "magnificent", "trevor"]
# Using symbol
tokeniser = WordsCounted::Tokeniser.new("Hello! محمد")
tokeniser.tokenise(exclude: :ascii_only?)
# => ["محمد"]
# Using an array
tokeniser = WordsCounted::Tokeniser.new(
"Hello! اسماءنا هي محمد، كارولينا، سامي، وداني"
)
tokeniser.tokenise(
exclude: [:ascii_only?, /محمد/, ->(t) { t.length > 6}, "و"]
)
# => ["هي", "سامي", "وداني"]
The default regexp accounts for letters, hyphenated tokens, and apostrophes. This means twenty-one is treated as one token. So is Mohamad's.
/[\p{Alpha}\-']+/
You can pass your own criteria as a Ruby regular expression to split your string as desired.
For example, if you wanted to include numbers, you can override the regular expression:
counter = WordsCounted.count("Numbers 1, 2, and 3", pattern: /[\p{Alnum}\-']+/)
counter.tokens
#=> ["numbers", "1", "2", "and", "3"]
Use the from_file
method to open files. from_file
accepts the same options as .count
. The file path can be a URL.
counter = WordsCounted.from_file("url/or/path/to/file.text")
A hyphen used in leu of an em or en dash will form part of the token. This affects the tokeniser algorithm.
counter = WordsCounted.count("How do you do?-you are well, I see.")
counter.token_frequency
[
["do", 2],
["how", 1],
["you", 1],
["-you", 1], # WTF, mate!
["are", 1],
# ...
]
In this example -you
and you
are separate tokens. Also, the tokeniser does not include numbers by default. Remember that you can pass your own regular expression if the default behaviour does not fit your needs.
The program will normalise (downcase) all incoming strings for consistency and filters.
def self.from_url
# open url and send string here after removing html
end
See contributors.
git checkout -b my-new-feature
)git commit -am 'Add some feature'
)git push origin my-new-feature
)Author: abitdodgy
Source code: https://github.com/abitdodgy/words_counted
License: MIT license
#ruby #ruby-on-rails
1658068560
WordsCounted
We are all in the gutter, but some of us are looking at the stars.
-- Oscar Wilde
WordsCounted is a Ruby NLP (natural language processor). WordsCounted lets you implement powerful tokensation strategies with a very flexible tokeniser class.
["Bayrūt"]
and not ["Bayr", "ū", "t"]
, for example.Add this line to your application's Gemfile:
gem 'words_counted'
And then execute:
$ bundle
Or install it yourself as:
$ gem install words_counted
Pass in a string or a file path, and an optional filter and/or regexp.
counter = WordsCounted.count(
"We are all in the gutter, but some of us are looking at the stars."
)
# Using a file
counter = WordsCounted.from_file("path/or/url/to/my/file.txt")
.count
and .from_file
are convenience methods that take an input, tokenise it, and return an instance of WordsCounted::Counter
initialized with the tokens. The WordsCounted::Tokeniser
and WordsCounted::Counter
classes can be used alone, however.
WordsCounted.count(input, options = {})
Tokenises input and initializes a WordsCounted::Counter
object with the resulting tokens.
counter = WordsCounted.count("Hello Beirut!")
Accepts two options: exclude
and regexp
. See Excluding tokens from the analyser and Passing in a custom regexp respectively.
WordsCounted.from_file(path, options = {})
Reads and tokenises a file, and initializes a WordsCounted::Counter
object with the resulting tokens.
counter = WordsCounted.from_file("hello_beirut.txt")
Accepts the same options as .count
.
The tokeniser allows you to tokenise text in a variety of ways. You can pass in your own rules for tokenisation, and apply a powerful filter with any combination of rules as long as they can boil down into a lambda.
Out of the box the tokeniser includes only alpha chars. Hyphenated tokens and tokens with apostrophes are considered a single token.
#tokenise([pattern: TOKEN_REGEXP, exclude: nil])
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise
# With `exclude`
tokeniser = WordsCounted::Tokeniser.new("Hello Beirut!").tokenise(exclude: "hello")
# With `pattern`
tokeniser = WordsCounted::Tokeniser.new("I <3 Beirut!").tokenise(pattern: /[a-z]/i)
See Excluding tokens from the analyser and Passing in a custom regexp for more information.
The WordsCounted::Counter
class allows you to collect various statistics from an array of tokens.
#token_count
Returns the token count of a given string.
counter.token_count #=> 15
#token_frequency
Returns a sorted (unstable) two-dimensional array where each element is a token and its frequency. The array is sorted by frequency in descending order.
counter.token_frequency
[
["the", 2],
["are", 2],
["we", 1],
# ...
["all", 1]
]
#most_frequent_tokens
Returns a hash where each key-value pair is a token and its frequency.
counter.most_frequent_tokens
{ "are" => 2, "the" => 2 }
#token_lengths
Returns a sorted (unstable) two-dimentional array where each element contains a token and its length. The array is sorted by length in descending order.
counter.token_lengths
[
["looking", 7],
["gutter", 6],
["stars", 5],
# ...
["in", 2]
]
#longest_tokens
Returns a hash where each key-value pair is a token and its length.
counter.longest_tokens
{ "looking" => 7 }
#token_density([ precision: 2 ])
Returns a sorted (unstable) two-dimentional array where each element contains a token and its density as a float, rounded to a precision of two. The array is sorted by density in descending order. It accepts a precision
argument, which must be a float.
counter.token_density
[
["are", 0.13],
["the", 0.13],
["but", 0.07 ],
# ...
["we", 0.07 ]
]
#char_count
Returns the char count of tokens.
counter.char_count #=> 76
#average_chars_per_token([ precision: 2 ])
Returns the average char count per token rounded to two decimal places. Accepts a precision argument which defaults to two. Precision must be a float.
counter.average_chars_per_token #=> 4
#uniq_token_count
Returns the number of unique tokens.
counter.uniq_token_count #=> 13
You can exclude anything you want from the input by passing the exclude
option. The exclude option accepts a variety of filters and is extremely flexible.
:odd?
.tokeniser =
WordsCounted::Tokeniser.new(
"Magnificent! That was magnificent, Trevor."
)
# Using a string
tokeniser.tokenise(exclude: "was magnificent")
# => ["that", "trevor"]
# Using a regular expression
tokeniser.tokenise(exclude: /trevor/)
# => ["magnificent", "that", "was", "magnificent"]
# Using a lambda
tokeniser.tokenise(exclude: ->(t) { t.length < 4 })
# => ["magnificent", "that", "magnificent", "trevor"]
# Using symbol
tokeniser = WordsCounted::Tokeniser.new("Hello! محمد")
tokeniser.tokenise(exclude: :ascii_only?)
# => ["محمد"]
# Using an array
tokeniser = WordsCounted::Tokeniser.new(
"Hello! اسماءنا هي محمد، كارولينا، سامي، وداني"
)
tokeniser.tokenise(
exclude: [:ascii_only?, /محمد/, ->(t) { t.length > 6}, "و"]
)
# => ["هي", "سامي", "وداني"]
The default regexp accounts for letters, hyphenated tokens, and apostrophes. This means twenty-one is treated as one token. So is Mohamad's.
/[\p{Alpha}\-']+/
You can pass your own criteria as a Ruby regular expression to split your string as desired.
For example, if you wanted to include numbers, you can override the regular expression:
counter = WordsCounted.count("Numbers 1, 2, and 3", pattern: /[\p{Alnum}\-']+/)
counter.tokens
#=> ["numbers", "1", "2", "and", "3"]
Use the from_file
method to open files. from_file
accepts the same options as .count
. The file path can be a URL.
counter = WordsCounted.from_file("url/or/path/to/file.text")
A hyphen used in leu of an em or en dash will form part of the token. This affects the tokeniser algorithm.
counter = WordsCounted.count("How do you do?-you are well, I see.")
counter.token_frequency
[
["do", 2],
["how", 1],
["you", 1],
["-you", 1], # WTF, mate!
["are", 1],
# ...
]
In this example -you
and you
are separate tokens. Also, the tokeniser does not include numbers by default. Remember that you can pass your own regular expression if the default behaviour does not fit your needs.
The program will normalise (downcase) all incoming strings for consistency and filters.
def self.from_url
# open url and send string here after removing html
end
Are you using WordsCounted to do something interesting? Please tell me about it.
Visit this website for one example of what you can do with WordsCounted.
Contributors
See contributors.
git checkout -b my-new-feature
)git commit -am 'Add some feature'
)git push origin my-new-feature
)Author: Abitdodgy
Source Code: https://github.com/abitdodgy/words_counted
License: MIT license